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ABSTRACT 

Theoretical arguments and indirect observational evidence suggest that the stellar initial mass function (IMF) 
may evolve with time, such that it is more weighted toward high mass stars at higher redshift. Here we test this 
idea by comparing the rate of luminosity evolution of massive early-type galaxies in clusters at 0.02 < z < 0.83 
to the rate of their color evolution. A combined fit to the rest-frame U -V color evolution and the previously 
measured evolution of the M/Lb ratio gives x = -0.3;!;q7 for the logarithmic slope of the IMF in the region 
around IMq, significantly flatter than the present-day value in the Milky Way disk of x = L3 ±0.3. The 
best-fitting luminosity-weighted formation redshift of the stars in massive cluster galaxies is 3.7^q g, and a 
possible interpretation is that the characteristic mass rric had a value of ^ 2Mq at z ^ 4 (compared to nic ^ 
0. 1 Mq today), in qualitative agreement with models in which the characteristic mass is a function of the Jeans 
mass in molecular clouds. Such a "bottom-light" IMF for massive cluster galaxies has significant implications 
for the interpretation of measurements of galaxy formation and evolution. Applying a simple form of IMF 
evolution to literature data, we find that the volume-averaged star formation rate at high redshift may have been 
overestimated (by a factor of 3 -4 at z > 4), and the cosmic star formation history may have a fairly well-defined 
peak at z L5. The M/Ly ratios of galaxies are less affected than their star formation rates, and future data 
on the stellar mass density at z > 3 will provide further constraints on IMF evolution. The formal errors likely 
underestimate the uncertainties, and confirmation of these results requires a larger sample of clusters and the 
inclusion of redder rest-frame colors in the analysis. 

Subject headings: galaxies: high-redshift — galaxies: evolution — galaxies: elliptical and lenticular, cD — 
galaxies: stellar content — stars: luminosity function, mass function 



1. INTRODUCTION 

The form of the stellar initial mass function (IMF) is of 
fundamental importance for many areas of astrophysics and a 
topic of considerable debate (see, e.g., Schmidt 1959; Miller 
& Scalo 1979; Scalo 1986; Larson 1998, 2003; Ki-oupa 2002; 
Chabrier 2003, for reviews). Measurements of the IMF are 
difficult and somewhat model-dependent as they require the 
conversion of the observed present-day luminosity function 
of a stellar population to its mass function at birth. Best es- 
timates for the Galactic disk suggest that the IMF has a pow- 
erlaw slope at m > IMq, and turns over at lower masses 
(Kroupa 2001 ; Chabrier 2003). This turnover can be modeled 
by a broken powerlaw (Kroupa 2001) or by a log-normal dis- 
tribution with a characteristic mass nic (Chabrier 2003). The 
value of nic is ~ 0.1 Mq in the disk of the Milky Way, with 
considerable uncertainty. The powerlaw slope at high masses 
is probably close to the Salpeter (1955) value of ;c = 1 .35, with 
an uncertainty of - 0.3 (Scalo 1986; Chabrier 2003). 

Although there is no direct evidence for dramatic varia- 
tions of the IMF within the present-day Milky Way disk (e.g., 
Kroupa 2001; Chabrier 2003), this does not preclude varia- 
tions with time, metallicity, and/or environment. In partic- 
ular, Larson (1998, 2005) has argued that the characteristic 
turnover mass may be largely determined by the thermal Jeans 
mass, which strongly depends on temperature (oc T^^^ at fixed 
density). In the context of this model one might expect that 
heating by ambient far-infrared radiation would disfavor the 
formation of low mass stars in extreme environments, such as 
in super star clusters and in the center of the Milky Way. Other 

' Based on observations with the NASA/ESA Hubble Space Telescope, 
obtained at the Space Telescope Science Institute, which is operated by 
AURA, Inc., under NASA contract NAS 5-26555. 



models emphasize the role of turbulence as opposed to tem- 
perature in determining the distribution of protostellar clumps 
(e.g., Padoan & Nordlund 2002), and in these models the role 
of the environment may be less direct (see McKee & Ostriker 
[2007] for a recent review of various models to explain the 
characteristics of the IMF). 

Observations may support the notion of a top-heavy (or 
"bottom-light") IMF in extreme environments. Some young 
super star clusters in M82 appear to have a top-heavy mass 
function (e.g., Rieke et al. 1993; McCrady, Gilbert, & Gra- 
ham 2003), as do clusters in the Galactic center region (e.g., 
Figer et al. 1999; Stolte et al. 2005; Maness et al. 2007). 
The interpretation of observed mass functions is complicated 
by dynamical effects, which tend to make the mass function 
more top-heavy over time, in particular in the central regions 
of star clusters (see, e.g., McCrady et al. 2003; McCrady, Gra- 
ham, & Vacca 2005; Kim et al. 2006). Recently Harayama, 
Eisenhauer, & Martins (2007) studied the IMF of NGC 3603, 
one of the most massive Galactic star-forming regions, out to 
large radii and conclude that its IMF is substantially flatter 
than Salpeter for masses 0.4-20Mq. 

The IMF may also depend on redshift. At earlier times star 
formation presumably occurred more often in a burst mode 
than in a relatively gradual "disk" mode (e.g., Steidel et al. 
1996; Blain et al. 1999b; Lacey et al. 2007), which means 
that the IMF could generally be more skewed toward high 
mass stars at redshifts 1-3 and beyond. Furthermore, the av- 
erage metallicity in star forming clouds was lower at higher 
redshift, which may have led to an extremely top-heavy IMF 
for the first generation of stars (e.g., Abel, Bryan, & Norman 
2002; Bromm, Coppi, & Larson 2002). Finally, the cosmic 
microwave background (CMB) radiation sets a floor to the 
ambient temperature, and hence the Jeans mass, which scales 
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with (1+z). Beyond z ~ 2 the CMB temperature exceeds 
the typical temperatures of dense prestellar cores in Galactic 
molecular clouds (e.g. Evans et al. 2001; Tafalla et al. 2004). 
Therefore, at sufficiently high redshift the characteristic mass 
may be expected to evolve roughly as nic oc leading 
to IMFs which have a reduced fraction of low mass stars (Lar- 
son 1998). The effects of the CMB are even more pronounced 
when its influence on the pressure in star-forming clouds is 
taken into account, and Larson (2005) suggests that at z = 5 
the characteristic mass may be higher than today's value by 
as much as an order of magnitude. Such rapid evolution of 
the IMF would have important consequences for determina- 
tions of masses and star formation rates of distant galaxies, 
and for measurements of evolution in these properties. 

It is very difficult to constrain the IMF at early times di- 
rectly, as the light of high redshift galaxies is completely dom- 
inated by massive stars. The extremely blue rest-frame UV 
colors of galaxies at z ^ 6 may imply a top-heavy IMF (Stan- 
way, McMahon, & Bunker 2005), although this is just one of 
several possible explanations. From observations of a lensed 
Lyman break galaxy there is some evidence that the slope of 
the IMF at z 3 is similar to the Salpeter value at the high 
mass end (Pettini et al. 2000), but there is essentially no in- 
formation on stars with masses near or below 1 Mq. 

Fortunately, the form of the high redshift IMF has implica- 
tions for the properties of galaxies at much lower redshift, as 
all stars with masses < 0.8 Mq that formed in the history of 
the Universe are still with us today. Tumlinson (2007) finds 
that the properties of carbon-enhanced metal-poor stars in our 
Galaxy are best explained with a relatively high number of 
stars in the mass range 1-8 Mq at high redshift. Various other 
constraints obtained from galaxies at low redshift (including 
our own) are reviewed in Chabrier (2003). Of particular in- 
terest are the stellar populations of massive early-type galax- 
ies, as they are very homogeneous and should reflect condi- 
tions in star forming regions at z > 2. Recently, Cappellari 
et al. (2006) used the kinematics of elliptical galaxies to con- 
strain the IMF, as the dynamical M/L ratio provides an up- 
per limit to the amount of mass that can be locked up in low 
mass stars. Current data appear to rule out a Salpeter (1955) 
(or steeper) IMF, but are consistent with Kroupa (2001) and 
Chabrier (2003) IMFs (Cappellari et al. 2006). 

In this paper, we provide new constraints on the IMF at 
high redshift by comparing the evolution of the M/L ratios 
of early-type galaxies to their color evolution. This method 
was first suggested by Tinsley (1980), but data of sufficient 
accuracy are only now becoming available. The method is 
sensitive to the IMF in the important mass range around 1 Mq, 
where the effects of an evolving characteristic mass might be 
expected to manifest themselves. 

A plan of the paper follows. In § 2 a relation between color 
evolution, luminosity evolution, and the logarithmic slope of 
the IMFx is derived using stellar population synthesis models. 
In § 3 published data and archival HST images of galaxy clus- 
ters are used to construct the redshift evolution of the U -V 
color-mass relation. In § 4 the color evolution from § 3 is com- 
bined with the previously measured evolution of the M /Lb ra- 
tio. The relations from § 2 are then used to derive constraints 
on the IMF slope x from the combined color and luminosity 
evolution. Section 5 is devoted to the (many) systematic un- 
certainties in the methodology and in the data, and § 6 asks 
whether our results are consistent with other constraints on 
the stellar populations of massive early-type galaxies. Al- 



though the constraints we derive in this paper are subject to 
many uncertainties, it is interesting to explore their conse- 
quences. In § 7 the fitting results of § 4 are interpreted in the 
context of an evolving characteristic mass nic- The data on 
cluster galaxies are combined with previous constraints on nic 
for globular clusters and submm-galaxies, and a simple form 
of IMF evolution is proposed. This evolution is then applied 
to literature data on the evolution of the volume-averaged star 
formation rate and stellar mass density. The key results are 
summarized in §8. We assume il,„ = 0.3, JIa = 0.7, and 
Ho = 71 km s"' Mpc"' where needed. 

2. METHODOLOGY 

2. 1 . Effects of the IMF on Luminosity and Color Evolution 

The form of the IMF has a strong effect on the evolution 
of the M/L ratio of galaxies. The luminosity of a stellar pop- 
ulation is expected to evolve even in the absence of star for- 
mation or mergers, due to the fact that stars turn off the main 
sequence and die. The rate of luminosity evolution in a given 
passband is determined by a combination of the lifetime of 
stars near the turn-off and the number of these stars. The life- 
time of turn-off stars is directly determined by the age of the 
stellar population (with the lifetime approximately equal to 
the age), and the number of turn-off stars is directly deter- 
mined by the IMF. A flat, top-heavy IMF implies a relatively 
large number of short-lived massive stars and rapid luminos- 
ity evolution, whereas a steep IMF implies a relatively large 
number of long-lived low mass stars and slow evolution. 

The color evolution behaves differently, as it is insensitive 
to the number of stars that evolves off the main sequence over 
time. Instead it is mainly driven by the location of the turn- 
off in the Herzsprung-Russell (HR) diagram, which does not 
depend on the IMF but is determined by the age of the stel- 
lar population. The IMF does influence the color evolution to 
some extent, as a top-heavy IMF reduces the number of turn- 
off stars with respect to the more luminous red giants. The lo- 
cation of the giant branch evolves much less than the location 
of the turn-off, and a flat IMF therefore implies weaker color 
evolution than a steep IMF. As first pointed out by Tinsley 
(1980), the IMF thus has opposite effects on the luminosity 
and color evolution: a more top-heavy IMF leads to stronger 
luminosity evolution and weaker color evolution. 

2.2. Stellar Population Synthesis Models 

Following Tinsley (1980) and many later studies, we pa- 
rameterize luminosity and color evolution by power laws. 
Power law approximations render the predicted evolution in- 
dependent of the Hubble constant and the absolute age cali- 
bration of the model, and smooth out artifacts in the evolution 
of single-age stellar populations caused by numerical effects. 
In our passbands (see §[3]i luminosity evolution takes the form 

logM/Lfi = Kb log(r - tform) + Ci ( 1 ) 

and color evolution takes the form 

U-V = 2.5k,u-v log(f - ffo™) + C2, (2) 

with ffoim the luminosity-weighted mean star formation epoch 
and Ci and C2 constants. 

We use the Maraston (2005) stellar population synthesis 
models to determine kb and k^-v- Fig.lTJa) and (b) show the 
evolution of the M /Lb ratio and U-V color in this model for 
three different values of x (1.3, 0.7, 0). These IMFs are power 
laws with a fixed slope over the mass range 0.1- 100 M©. The 
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Fig. 1 . — Predictions of luminosity evolution (a) and color evolution (b) for three different values of the logarithmic slope of the IMF x, from 
Maraston (2005) models with [Fe/H] = 0.35 and ages of 3- 12 Gyr. Luminosity evolution is plotted as a function of color evolution in (c). 
Broken lines are power law fits to the model predictions. Panels (d-f) show the power law coefficients obtained for the three different values of 
X. Lines are linear fits to the points. The dotted line in (f) is the relation implied by the fits in (d) and (e). Note the strong IMF dependence of 
luminosity evolution and the weak (and opposite) IMF dependence of color evolution. Open symbols and dashed lines show results for models 
with [Fe/H] = 0. The color evolution, and hence also the ratio of luminosity to color evolution, is quite dependent on the assumed metallicity. 



predictions are for a metallicity [Z/H] = 0.35. This relatively 
high value may be more appropriate for massive early-type 
galaxies than the Solar value (e.g., Worthey, Faber, & Gonza- 
lez 1992), although Solar models are much better calibrated. 
As expected, luminosity evolution has a strong IMF depen- 
dence but color evolution only has a weak (and opposite) IMF 
dependence. Panel (c) directly compares the evolution of the 
M/L ratio to the color evolution. The IMF dependence is rel- 
atively strong in this panel, due to the opposite effects of the 
IMF on luminosity and color evolution. 

Broken lines in panels (a-c) are power law fits to the model 
predictions. These fits generally provide good descriptions of 
the behavior of the models. The largest deviations are for the 
U-V colors at 9 and 12 Gyr, where the fits are off by +0.02 
and -0.02 mag respectively. Panels (d-f) show the relation 
between the powerlaw coefficients and the IMF slope. In each 
case there is a well-defined relation, that can be characterized 
by a simple linear fit. These fits take the form 



the fits are excellent for < jc < 1.3 the extrapolations out- 
side that range are obviously uncertain. Note that Eq. |5] is 
a fit, and is not identical to the combination of Eqs. |3] and 
m The dotted line in Fig. [Ttf) shows the relation implied 
by Eqs. |3] and |4l this relation is consistent with Eq. |5] to a 
few percent. Open symbols and dashed lines are predictions 
for models with [Fe/H] = 0. Luminosity evolution is insen- 
sitive to [Fe/H], but color evolution has a strong metallicity- 
dependence. Therefore, the relation in panel (f) also has a 
strong metallicity dependence, and takes the form 



Kb 

Ku-V 



= 6.93-1. 81x 



(6) 



Kg = 



Ku-V = 0.4 



d(log(M/LB)) 

t/(Iog(f-fform)) 

d(U-V) 

c/(log(f-fform)) 



= 1.30-0.24JC, 



0.23 + 0.016X, 



and 



Kb 



■■2.5 



d{\og{M/LB)) 



:5.44-1.23x. 



(3) 
(4) 

(5) 



Ku-V d{U-V) 
The fits are shown by the solid lines in Fig.lTJd-f). Although 



for Solar metallicity. When determining the IMF slope in §|4] 
results will be given both for super-Solar models and Solar 
models (which are better calibrated). 

2.3. Implementation 

As demonstrated in Fig. [1] the IMF mostly influences the 
luminosity evolution, and the color evolution can be viewed 
as an IMF-insensitive "clock" constraining the age. By com- 
bining color and luminosity evolution for a well-chosen set 
of galaxies over a sufficiently large redshift range the slope 
of the IMF X and the formation time of the stars ffo„n can, in 
principle, be constrained simultaneously (although there are 
many caveats - see § 5). 

The method requires very accurate rest-frame color and 
M/L measurements for a passively evolving sample of galax- 
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ies. This is not an easy task, as galaxy evolution is in gen- 
eral a complex process involving star bursts, mergers, and 
morphological changes. The most massive early-type galax- 
ies in clusters at modest redshifts (z < 1) come closest to a 
"purely passive" sample. Their colors and M /L ratios have 
very small scatter (e.g.. Bower, Lucey, & Ellis 1992a; J0r- 
gensen, Franx, & Kjsergaard 1996), their luminosity and color 
evolution are consistent with passive evolution of stellar pop- 
ulations formed at redshifts z > 2 (e.g., van Dokkum et al. 
1998a; Holden et al. 2004), and they appear to have been 
largely in place hy z~ 0.8 (Holden et al. 2006, 2007). 

Recently a large number of measurements of the M /Lg evo- 
lution of early-type galaxies with masses M > 10" Mq were 
compiled from the literature and placed on a consistent sys- 
tem (van Dokkum & van der Marel 2007) [hereafter vv07]. 
Color evolution has also been studied by many authors (e.g., 
Stanford, Eisenhardt, & Dickinson 1995; Ellis et al. 1997; 
Holden et al. 2006), but not yet for mass-limited samples and 
with the required accuracy. With the availability of archival 
multi-color //M/j/j/e Space Telescope (HST) WFPC2 and ACS 
data of a subset of the clusters in vv07 it is now possible to de- 
termine color and luminosity evolution of massive early-type 
galaxies in clusters simultaneously, and begin to constrain the 
form of the IMF at the mean star formation epoch of massive 
early-type galaxies. 

3. DATA 

In this Section, data from the literature and the HST Archive 
are used to determine the evolution of rest-frame U -V col- 
ors of galaxies in clusters from the vv07 sample. Because 
mass measurements of individual galaxies are available the 
color evolution can be determined from the zeropoint of the 
color-mass (rather than the color-magnitude) relation. Also, 
the same mass limit can be applied as was applied to the M/L 
data. We focus onU-V because redder rest-frame colors are 
not available for clusters beyond z ^ 0.5. 

3.1. The Color-Mass Relation in Coma 

The color-magnitude relation in the Coma cluster was stud- 
ied by Bower et al. (1992a,b) [hereafter BLE92]. An im- 
portant goal of their study was to determine the relative dis- 
tance between the Virgo and Coma cluster, and this required 
very accurate absolute color measurements. The data from 
BLE92 are combined with velocity dispersions and effective 
radii from J0rgensen, Franx, & Kjaergaard (1995a,b). Masses 
are calculated using 



logM = 2 log cr + log + 6 .07 , 



(7) 



with a the velocity dispersion corrected to a 3 "4 diameter 
aperture and the effective radius in kpc. Radii were con- 
verted from arcsec to kpc assuming a Hubble flow velocity of 
I'flow = 7376 ± 223 km s~' (see vv07). 

Figure |2] shows the color-mass relation of galaxies in com- 
mon between the BLE92 color-magnitude sample and the J0r- 
gensen et al. fundamental plane (FP) sample. There is a very 
clear relation, of the form 



f/-y = 0.113(logM- 11)+ 1.460. 



(8) 



The slope of the relation is determined by fitting a linear 
function to all galaxies with masses M > 10" Mq, and the 
offset is determined with the biweight statistic (Beers, Flynn, 
& Gebhardt 1990), which gives low weight to outliers. This 
relation is indicated by the solid line in Fig. |2] We note that 



the data in Fig. |2] are not entirely self-consistent, as the colors 
were determined in a different aperture (1 1") than the velocity 
dispersions (3. "4). This has implications for the interpretation 
of Eq. [8] but it has no effect on our analysis as the distant 
galaxies are treated in the same way as the Coma galaxies. 




log M (Mq) 



Fig. 2. — Color-mass relation for the Coma cluster, using U-V 
colors from BLE92 and masses determined from radii and velocity 
dispersions given by J0rgensen et al. (1995a,b). Errors in masses 
were determined from formal uncertainties given by J0rgensen et al. 
Errors in colors reflect the rms in repeat measurements as given by 
BLE92. There is a clear relation with low scatter. The line is the best 
fit for galaxies withM > lO" Mq. 

The observed scatter in the relation is very small at au = 
0.029 mag for M > 10" M©. This low scatter is remarkable: 
from repeat measurements of the same galaxies BLE92 esti- 
mate that the observational uncertainty for individual galax- 
ies is 0.037, which would imply that the intrinsic scatter in 
U-V is well below 0.03 for the most massive Coma galaxies. 
However, we note that the biweight estimator gives very little 
weight to the blue outlier at logM = 11.1, and this is some- 
what arbitrary given the small sample size. The rms scatter is 
0.05, similar to values measured for clusters at higher redshift 
(see § [34b . 

In this study we are not concerned with the scatter or slope 
of the color-mass relation but only with its zeropoint evo- 
lution. For each cluster in the sample Eq. [8] is subtracted 
from the rest-frame U-V colors of early-type galaxies with 
M > 10" Mq, and the offset A(U-V) is determined with the 
biweight estimator By construction, A{U - V ) = for the 
Coma cluster. The random uncertainty in the offset due to 
galaxy-galaxy color variations is 0.009. BLE92 give a sys- 
tematic error of 0.02 in the U-V colors, reflecting uncertain- 
ties in the U and V zeropoints and small errors due to red- 
dening and aperture corrections. Adding the random and sys- 
tematic errors in quadrature gives a combined uncertainty of 
0.022. We note that the BLE92 data are still the most accu- 
rate available for Coma, even though they are now more than 
15 years old. In fact, the comprehensive photometric studies 
by Terlevich, Caldwell, & Bower (2001) and Eisenhardt et al. 
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(2007) calibrate their photometry by matching the magnitudes 
of galaxies to the BLE92 values. 

3.2. Other Nearby Clusters 

No other nearby clusters are currently available with high 
quality U -V colors, effective radii, and velocity dispersions 
for their constituent galaxies. As an example, effective radii 
and velocity dispersions are available for the clusters in the 
J0rgensen et al. (1996) sample, but accurate U-V colors have 
not been measured. Conversely, Mcintosh et al. (2005) ana- 
lyzed the color-magnitude relation in the three nearby clusters 
Abell 85, Abell 496, and Abell 754 using the same apertures 
and methods as BLE92, but those clusters have not been the 
subject of FP studies. 

Although the offsets A(f/ - V) for the three clusters of 
Mcintosh et al. (2005) cannot be determined directly, they can 
be estimated by making use of the tight relation between rest- 
frame y band luminosity and mass in nearby clusters (e.g., 
J0rgensen et al. 1996). Fitting directly to the BLE92 V band 
magnitudes (in our cosmology) and J0rgensen et al. (1996) 
masses gives 

logM = -0.57Mv-0.78, (9) 

with an rms scatter of 0.14 in logM. Applying this relation 
to the full BLE92 sample (i.e., not just to galaxies with mea- 
sured masses) produces an offset in the color-"pseudo"mass 
relation of -0.003, almost identical to the offset derived from 
the actual color-mass relation. 

Absolute V band magnitudes and U-V colors were taken 
from Table 6 in Mcintosh et al. (2005). The {U -V)iM col- 
ors are used as they were measured using the same physical 
aperture size as used by BLE92. The V magnitudes are cor- 
rected to our cosmology and converted into pseudo-masses 
using Eq.|9] Offsets are determined by subtracting Eq.[8]from 
the observed colors and determining the biweight mean resid- 
ual. The offsets ai-e -0.050 ± 0.042, -0.061 ± 0.061, and 
-0.002 ±0.051 for Abell 85, Abell 496, and Abell 754 re- 
spectively. The uncertainties are a combination of random 
and systematic errors. The systematic errors were calcu- 
lated from the listed uncertainties in the zeropoints and k- 
corrections and a 16% uncertainty in the listed reddening 
(Schlegel, Finkbeiner, & Davis 1998).^ All three offsets are 
consistent with zero, in agreement with the analysis of Mcin- 
tosh et al. (2005) and with earlier work by Andreon (2003), 
who studied a large sample of clusters in the Sloan Digital 
Sky Survey (SDSS). The uncertainties in A([/-y) are con- 
siderably larger than for Coma due to a combination of larger 
photometric zeropoint uncertainties and larger Galactic red- 
dening toward the Mcintosh et al. (2005) clusters. 

3.3. Data for Distant Clusters 

3.3.1. Available HST Imaging and Observed Colors 

For each of the 14 distant clusters described in vv07 we 
determined whether HST data exist in the appropriate filters 
(i.e., roughly corresponding to rest-frame U and V). HST data 
are crucial because of their photometric stability and because 
the combination of seeing and color gradients makes aperture 
effects difficult to control in ground-based data. Appropriate 
data are available for the seven clusters listed in Table[T] Clus- 
ters from vv07 that are not included in the analysis are Abell 

^ Note that Mcintosh et al. hst larger systematic errors; they do not add the 
individual error contributions in quadrature and also assume a slightly larger 
error in the extinction than given by Schlegel et al. (1998). 



665 (z = 0.183), as the bluest available HST filter corresponds 
to rest-frame V ; Abell 2390 (z = 0.228), as the HST filters cor- 
respond to rest-frame B-R; 3C295 (z = 0.456), as the vv07 
sample only has one galaxy more massive than 10" Mq, ex- 
cept 3C295 itself; CL 1601H-42 (z = 0.539), as HST observa- 
tions were obtained in a single filter only; and all three clusters 
with z > 0.85 as the reddest available HST colors are much 
bluer than rest-frame U-V. 

For two of the seven clusters in Table 1 accurate colors 
have been published: Blakeslee et al. (2006) provide ACS 
measurements of galaxies in MS 1054-03 (z = 0.831) and 
RXJ0152-13 (z = 0.837). For the five remaining clusters 
the HST data were obtained from the HST archive, and re- 
duced using standard techniques. The WFPC2 images were 
processed using a combination of STSDAS tasks and custom 
scripts (see, e.g., van Dokkum et al. 1998b). For ACS the 
photometrically and astrometrically corrected "drz" files were 
obtained, and further processing was limited to the removal 
of remaining cosmic rays and some other defects. Galaxies 
in common with the vv07 samples were deconvolved with the 
SCLEAN task in the IRAF STSDAS package, using synthetic 
PSFs generated with Tiny Tim (Krist 1995). The effect of 
deconvolution on the measured colors is typically ^ 0.01 at 
z-0.2 and -0.02 at 0.5. 

Colors were measured in apertures scaled to match an 11" 
diameter aperture at the distance of the Coma cluster, to al- 
low a direct comparison with BLE92. The measurements 
were converted from instrumental counts to magnitudes on 
the Vega system using the WFPC2 and ACS zeropoints. For 
WFPC2 the zeropoints were taken from § 28. 1 of the WFPC2 
Data Handbook, which lists values for the PC chip and each 
of the wide field detectors separately. For ACS the zeropoints 
were taken from the Tables on the STScI website^, which 
include small corrections made after publication of Sirianni 
et al. (2005). For the two z ~ 0.83 clusters the colors were 
taken from Tables 1 and 2 of Blakeslee et al. (2006). A small 
aperture correction was applied to each galaxy, as Blakeslee 
et al. used apertures containing 50 % of the light rather than 
an aperture of fixed size (see Appendix A). The colors were 
transformed from the AB to the Vega system using the AB 
offsets listed in Sirianni et al. (2005). 

3.3.2. Rest-frame Colors and Systematic Errors 

A key step in the analysis is the transformation of the ob- 
served colors in the various HST filters to rest-frame U -V . 
Following van Dokkum & Franx (1996) and many later stud- 
ies (e.g., Blakeslee et al. 2006) we derive transformations of 
the form 

f/, = F2 + ai(/^i-/^2) + /3i+2.51og(l+z) (10) 
y, = F2 + a2(/^i-/^2) + /32 + 2.51og(l+z) (11) 

with U, the rest-frame U band, V, rest-frame V, Fx and F2 the 
observed HST filters (listed in Table 1), and a and /? deter- 
mined from template spectra. The rest-frame color is given 
by 

(JJ - V\ = (ai - a2)(Fi - F2) + iPi - P2) 

= ai-2(Fi-F2) + A-2 (12) 

Unless F\, F2 correspond exactly to U,, V, the values of a and 
(3 have a (modest) dependence on the assumed spectral type. 
For each cluster the appropriate values were determined in 

^ http://www.stsci.edu/hst/acs/analysis/zeropoints/ 
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TABLE 1 
Color Evolution 



Cluster 


z 


Camera 


Fi 




"1-2 


l3l-2 




A(U-V) 


0"lot 


Coma 


0.024 




U 


V 






0.036 


0.000 


0.037 


Abell 2218 


0.176 


WFPC2 


F450W 


F702W 


1.230 


-1.217 


0.021 


-0.082 


0.028 


CL 1358+62 


0.327 


ACS 


F475W 


F775W 


0.955 


-1.253 


0.022 


-0.101 


0.033 


CL 0024+ 16 


0.391 


WFPC2 


F450W 


F814W 


0.793 


-1.181 


0.024 


-0.122 


0.031 


CL 0016+16 


0.546 


WFPC2 


F555W 


F814W 


1.065 


-1.227 


0.023 


-0.165 


0.035 


MS 2053-04 


0.583 


WFPC2 


F606W 


F814W 


1.503 


-1.243 


0.035 


-0.167 


0.039 


MS 1054-03 


0.831 


ACS 


F606W 


F850LP 


0.881 


-1.176 


0.033 


-0.200 


0.038 


RXJ0152-13 


0.837 


ACS 


F625W 


F850LP 


1.035 


-1.164 


0.021 


-0.185 


0.024 



the following way. We used synthetic templates from Yi et al. 
(2003) with a large range of ages to establish the relation be- 
tween the coefficients and the observed color in the HST fil- 
ters. The Yi et al. (2003) templates were chosen because they 
may be better calibrated in the rest-frame ultra-violet than the 
Bruzual & Chariot (2003) models; at rest-frame wavelengths 
A > 3600 A the two sets of models are identical. Typically, a 
and f3 vary by less than a few percent for ages varying from 1 
-12 Gyr; the amount of variation depends largely on how well 
the observed HST filters match the rest-frame U and V bands. 
The relations between a, j3 and the observed colors Fi - F2 
were fitted with smooth functions, and the values correspond- 
ing to the median observed color of galaxies in the sample 
were adopted. Values of ai-2 = oi\-a2 and /3i_2 = /3i -/32 are 
listed in Table 1 . 

The rest-frame U -Y colors are subject to several sources of 
systematic uncertainty. The zeropoints of the HST filters are 
accurate to < 0.02 mag. The dominant source of zeropoint 
error is thought to be the absolute calibration of Vega; e.g., 
updates to the ACS zeropoints in 2006 due to a recalibration 
of Vega were < 0.017. As we are always dividing the flux 
in two ACS or WFPC2 passbands, such errors (nearly) can- 
cel. Other effects that plague absolute calibration of (particu- 
larly) WFPC2 photometry, such as variations in charge trans- 
fer efficiency and the long-short anomaly, also cancel when 
measuring colors. Therefore, we estimate that the systematic 
uncertainty of the measured colors (before correcting for red- 
dening) is 0.010 for WFPC2 and 0.015 for ACS (as it has a 
shorter calibration history). Uncertainties in the Galactic red- 
dening corrections are « 16 % of the applied values (Schlegel 
et al. 1998), which is typically w 0.01 mag for our clusters. 
This error is largest for CL0024H-16, at 0.021. 

The transformations to rest-frame colors also have some un- 
certainty, particularly if the observed HST filters are not well 
matched to rest-frame U -Y . This uncertainty was assessed 
by varying the templates used for the transformation, com- 
paring results for the Yi et al. (2003) templates, Bruzual & 
Chariot (2003) templates, and empirical templates from Cole- 
man, Wu, & Weedman (1980). For most clusters the error is 
- 0.015 mag. For MS 2053-04 and MS 1054-03, which have 
the worst match of observed and rest-frame filters among the 
seven clusters, the uncertainties are 0.025 and 0.030 respec- 
tively. We note that, once again, a potentially important sys- 
tematic effect cancels: the form of the synthetic rest-frame JJ 
and Y filters, and their AB zeropoints, is the same for all dis- 
tant clusters, which means that the distant clusters are all on 
the exact same synthetic photometric system. There may be 
a systematic offset between this system and Coma, although 
we use the same Bessell (1990) passbands as Landolt (1992). 
This possible offset was taken into account by adding a 0.03 
mag error in quadrature to the systematic uncertainty of the 



Coma offset. Assuming that the sources of systematic error 
are independent the combined error is given by 

2 2 2 2 2 

C^sys = ai-2(c^zero + <^red) + C^trafo- (13) 

The values of (Jsys are hsted in Table 1. They are generally 
small, of order 2 % - 3 %. The reliability of these numbers is 
empirically assessed below. 

3.4. Evolution of the Color-Mass Relation 

For each of the distant clusters in the sample the color-mass 
relation is constructed by combining the rest-frame U-Y col- 
ors with masses determined using Eq. |2l The required effec- 
tive radii and velocity dispersions are taken from the litera- 
ture and corrected to a consistent system (see Appendix A of 
vv07). The color-mass relations are shown in Fig. [3] Galaxies 
at higher redshift have bluer rest-frame U-Y colors, as ex- 
pected from stellar evolution and from many previous studies 
of color evolution (e.g., Stanford et al. 1995, 1998; Kodama 
etal. 1998;Holdenetal. 2004). 

The evolution of the zeropoint is quantified by subtracting 
Eq. [8]from the datapoints, and determining the center of the 
distribution of residuals with the biweight statistic. Choosing 
the average or the median rather than the biweight slightly 
changes the offset of each cluster (always well within its er- 
ror), but does not have an effect on our conclusions. We ver- 
ified that the slopes for the high-redshift clusters are consis- 
tent with the Coma slope by fitting the color-mass relation of 
all galaxies in the seven distant clusters simultaneously, after 
removing the zeropoint evolution derived below. The differ- 
ence between this slope and the adopted Coma slope is only 
0.01 ±0.02. Varying the adopted slope within these limits 
does not change the derived offsets. The analysis relies on 
the assumption that the slopes do not vary appreciably from 
cluster to cluster. The samples of galaxies with mass mea- 
surements are too small to determine the slopes of the color- 
mass relations in individual clusters, but we note that Holden 
et al. (2004) find no evidence for either evolution or cluster-to- 
cluster variation in the slope of the color-magnitude relation. 

The offsets are listed in Table 1 . For each cluster the un- 
certainty in the offset (Ttot is the quadratic sum of the system- 
atic error and the uncertainty in the biweight mean (which is 
caused by the galaxy-galaxy scatter within each cluster). For 
most clusters the random uncertainty is of the same order as 
the systematic uncertainty. The evolution of the zeropoint of 
the color-mass relation is shown in Fig. ID The evolution is 
very regular for these clusters, which is perhaps not surpris- 
ing given previous studies (e.g., Kodama et al. 1998; Andreon 
2003; Holden et al. 2006) and the small cluster-to-cluster 
scatter in the evolution of M/L ratios (see, e.g., vv07). 

The solid line in Fig.|4]is the best fitting linear function to 
the data (including the three Mcintosh et al. [2005] clusters): 
AiU-Y) = (-0.028 ± 0.018) - (0.214 ± 0.041)z. (14) 
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Fig. 3. — Evolution of the color-mass relation, as determined from HST photometry in combination with ground-based dynamical measure- 
ments. Solid symbols indicate galaxies with dynamical masses > lO" Mq, which are used for determining the evolution of the relation. The 
broken line in each panel is the best fit to the Coma cluster. Solid lines are the best fitting relations for each cluster, keeping the slope fixed but 
allowing the zeropoint to vary. The data confirm that massive cluster galaxies gradually redden with time, as expected from stellar evolution. 



The observed cluster-to-cluster scatter around this line is very 
small at 0.022 ±0.005 for all 11 clusters or 0.019 ±0.005 for 
the eight clusters with direct mass measurements. This re- 
markably small cluster-to-cluster scatter is qualitatively con- 
sistent with previous work by Andreon (2003), although we 
note that our study uses a much smaller number of clusters. 

The small measured scatter provides an external check on 
the reliability of the systematic errors derived in §[3] Fo- 
cusing on the eight clusters with direct mass measurements, 
the expected scatter from measurement errors alone is 0.032, 
higher than the observed scatter of 0.019. Assuming the er- 
rors are correct the probability of measuring a scatter < 0.019 
is only 7 %, which means the systematic errors are much 
more likely to be overestimated than underestimated. A for- 
mal limit on the systematic error can be derived by requiring 
that the observed scatter of 0.019 has a one-sided probability 
> 5 %, and parameterizing the total error for each cluster by 
i^tot = '''random "•"■^^"'sys' ^Ith F the Same for all clusters. From 
Monte-Carlo simulations we find that Q <F < 1.12, i.e., the 
systematic errors are consistent with zero and underestimated 
by at most 12 %. 



Most studies of the color-magnitude relation at high red- 
shift have focused on the galaxy-galaxy scatter in the relation. 
Although this is not a focus of the present paper, we can ro- 
bustly measure the scatter for galaxies with M > 10" Mq by 
removing the average trend (Eq.[T4b and combining the galax- 
ies in all distant clusters. The observed biweight scatter in the 
full sample au-v = 0.056 ± 0.005. The scatter expected from 
observational error is less than ~ 0.02, and we estimate that 
the intrinsic scatter is « 0.052. Averaging the three clusters at 
0.17 < z < 0.40, the two clusters at 0.54 < z < 0.59, and the 
two clusters at 0.83 < z < 0.84, we find no evidence for evolu- 
tion with redshift: the scatter in each redshift bin is consistent 
with the value for the full sample. The value for the scatter 
that we find is significantly lower than results from previous 
studies (e.g., Holden et al. 2004; Blakeslee et al. 2006). This 
is most likely due to a dependence of the scatter on galaxy 
mass. We also note that the color-mass relation is expected to 
have a smaller scatter than the color-luminosity relation even 
within the same sample of galaxies, if the scatter is due to age 
variations (see van Dokkum et al. 1998b). 
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Fig. 4. — Evolution of the zeropoint of the color-mass relation. 
Open symbols are three clusters from Mcintosh et al. (2005), for 
which no direct mass measurements are available. Note that there 
may be a small systematic offset between the Coma cluster at z = 
0.024 and the other clusters due to differences in methodology. The 
solid line is the best-fit linear function to the data; broken lines indi- 
cate the ±lcr uncertainty in the slope of the relation. 

4. FITTING 

4. 1 . Constraints on the IMF 

As discussed in §|2]luminosity and color evolution each de- 
pend on the IMF and on the age of the stellar population, but 
the age-dependence drops out when comparing the amount of 
luminosity evolution to the amount of color evolution. Figure 
|5] shows the measured evolution in \og{M/LB) as a function 
of the evolution in U -V . This Figure is the equivalent of 
Fig-Ec). The values for /S.\og{M / Lb) are taken directly from 
vv07, and the values for A(t/-y) are listed in Table 1. 

As expected, there is a clear relation, with galaxies becom- 
ing both bluer and more luminous at earlier times. The solid 
line is a linear fit to the data (taking the errors in both param- 
eters into account) of the form 

Alog(M/LB) = 2.8!j]jx A(f/-V) (15) 

The residuals from this fit are consistent with expectations 
from the uncertainties in the datapoints. The solid and dashed 
red lines in Fig. [15] are predictions for a Salpeter-like** IMF 
with x= 1.3 and two different metallicities. Remarkably, the 
observed relation is much steeper than expected from a stan- 
dard IMF, even for Solar metallicity. For a given color evolu- 
tion the luminosity evolves faster than expected, indicating 
a top-heavy IMF with a relatively large fraction of rapidly 
evolving massive stars. 

The logarithmic slope of the IMF follows directly from Eqs. 
|5] |6] and [15] We find x = -\At\ \ for super-Solar metallicity 

* The observations constrain the IMF in the mass range around 1 Mq , and 
at those masses differences between the Salpeter (1955), Kioupa (2001), and 
Chabrier (2003) IMFs are very smaU. In vv07 it is shown explicitly that these 
various IMFs all predict very similar M/L evolution over the relevant range 
of ages; hence the term "Salpeter-like" or "standai'd" to denote IMFs with 
slope j:f»1.3atm>l Mq . 



and X = -0.1;I;i 1 for Solar metallicity, where the uncertainties 
reflect the 68 % confidence interval. Both values are well be- 
low the canonical Salpeter value of x = 1 .3. Negative values of 
X have a large systematic uncertainty, as they require signifi- 
cant extrapolation of the Maraston (2005) models. Therefore, 
the results can best be expressed as upper limits, particularly 
for the super-Solar metallicity model. The 90 % confidence 
upper limits are x < 0. 1 for the super-Solar model and x < 0.9 
for Solar metallicity. The Salpeter value can be ruled out at 
> 98 % confidence. 




Mil -V) 

Fig. 5. — Comparison of luminosity evolution to color evolution, 
for massive galaxies in clusters at 0.02 < z < 0.83. The point at 
(0,0) is the Coma cluster. Red lines show the expected relation for 
IMFs with a slope ofx= 1.3 near 1 Mq . The solid line is a Maraston 
(2005) model of super-Solar metallicity; the broken line is the best- 
fitting powerlaw to this model. The dashed line represents a Maras- 
ton (2005) model with Solar metallicity. The solid black line shows 
the best-fitting relation, with the Icr uncertainty in grey. The evo- 
lution of the M/L ratio is faster than expected for "standard" IMFs, 
and is consistent with expectations for a top-heavy IMF. Arrows in- 
dicate the corrections that would have to be applied to account for 
redshift-dependent dust effects. 



4.2. Joint Constraints on the IMF and Formation Epoch 

Directl y fitt ing /WogiM/Ls) as a function of /\{U -V), as 
done in § 14. 11 has several advantages: the same galaxies are 
used to measure the relevant parameters, limiting selection ef- 
fects; redshift-dependent selection effects such as progenitor 
bias cancel; and the IMF is the only free parameter (at fixed 
metallicity), as the formation epoch of the stars Ziorm also can- 
cels. However, constraining Zform better is of great interest in 
its own right, and allows us to associate a particular time in 
the history of the universe with the IMF result. Here we fit 
the redshift evolution of the M /Lb ratio and the U -V color 
simultaneously, and derive joint constraints on the slope of 
the IMF X and the star formation epoch Zfonn- The goals are 
to determine the luminosity-weighted star formation epoch of 
massive cluster galaxies in a self-consistent way and to verify 
the results of § 14.11 Although the additional parameter im- 
plies more freedom in the fits this is compensated by the fact 
that more data can be used, as we are no longer restricted to 
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Fig. 6. — Redshift evolution of the rest-frame M/Lb ratio of massive cluster galaxies from vv07 (a), and of the rest-frame U — V color (b). 
Corrections of -0.05z and -0.023z have been applied to the data in panels (a) and (b) respectively, to account for (mild) progenitor bias. The 
lines show models with different formation redshifts zfoim and IMF slopes x. The red model has a standard IMF and zfoim = 2, the blue model 
has a standard IMF and Zfo,„, = 6, and the black model has a flat IMF and Zfonn = 6. Only the black model is a good fit to both datasets. 



the sample of eight clusters which have both M /L and color 
information. 

The data are shown in Fig. |6] Panel (a) shows offsets in 
/S.\og{M / Lb) from vv07, and panel (b) shows the offsets in 
U -V determined in the present study. The dataset is more 
extensive than used in the analysis of § 14.11 The M/L sample 
includes data from the SDSS and for seven additional distant 
clusters. The color sample includes the three clusters from 
Mcintosh et al. (2005), which help constrain the low red- 
shift end. Small corrections for progenitor bias have been ap- 
plied, following vv07: -0.05z for Alog(M/LB) and -0.023 
for /S.{U-V). The color correction was chosen to be con- 
sistent with the correc tion for luminosity evolution and with 
Eq. [15] As shown in § 15.11 setting the progenitor bias to zero 
leads to slightly higher formation redshifts but has virtually 
no impact on the IMF constraints. 

The data are fit by creating models over a grid of x and ffom, 
with X the logarithmic slope of the IMF and fform the mean lu- 
minosity weighted formation time of the stars. For each value 
of X corresponding values of kb and ku-v are determined us- 
ing Eqs.[3]and|4l Next, Eqs.[T]and|2]are used to determine the 
expected evolution with cosmic time for each combination of 
X and /form- The values of the fits are then calculated by 
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The fit has four free parameters: x, ffo„n (or equivalently Zform), 
and the normalizations to the M/L and color data. Figure |7] 
shows the 68 %, 95 %, and 99 % confidence intervals in the 
X, fform plane, as determined from the Ax^ values appropriate 
for two interesting parameters. Solid contours are for super- 




Solar metallicity ([Fe/H] = 0.35) Maraston (2005) models and 
broken contours are for Solar metallicity models. Flat IMFs 
and high formation redshifts are clearly preferred, for both 
metallicities. 
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Fig. 7. — Results from simultaneous fits to the redshift evolution 
of logM/Lfi and U -V color. Contours indicate 68%, 95%, and 
99% confidence limits, as determined from the Ax"(?form,Jt") distri- 
bution. Solid contours are for Maraston et al. (2005) models with 
[Fe/H] = 0.35, and broken contours are for [Fe/H] = 0. Early star for- 
mation and top-heavy IMFs are clearly preferred. Red, blue, and 
black dots indicate the locations of the models shown in Fig. [6] 
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The red, blue, and black dots indicate examples of models 
with different parameters. The fits to the luminosity and color 
evolution for these three models are shown in Fig.|6] and illus- 
trate how the data simultaneously constrain x and ffom- The 
red model has a standard Salpeter IMF and a low formation 
redshift Zfoim = 2. This model provides an excellent fit to the 
evolution of the M /Lb ratio, as shown in Fig. |6ja) and dis- 
cussed in vv07. However, this model fits the data in Fig.|6jb) 
poorly: the observed color evolution is much slower than pre- 
dicted by this young model. The blue model is a Salpeter 
model with a high formation redshift Zfovm = 6. This model 
fits the slow color evolution very well, but underpredicts the 
M/L evolution in Fig. |6ja). The only models that fit the data 
in both panels simultaneously have high formation redshifts 
(to fit the colors) and a flat IMF (to fit the M/L ratios). The 
black model has Zfom = 6 and x = -0.9. This model provides 
an excellent fit to the data in both panels. 

The formal best-fitting values are Zform = 7.6, x = -1.2 for 
super-Solar metallicity, and Zfo„n = 3.7, x = -0.3 for Solar 
metallicity. The best-fit values for the IMF slope are in very 
good agreement with those derived in § 14.11 As discussed 
earlier negative values for x have large systematic uncertain- 
ties, and may not be very meaningful. The super-Solar so- 
lutions are therefore somewhat difficult to interpret. Formal 
68 % confidence intervals for the Solar metallicity solution are 
-1.0 < X < 0.1 and 2.9 < Zform < 6.0. These results are con- 
sistent with the analysis of the M/L evolution alone in vv07, 
where it was noted that the best-fitting Zfmm ~ 4 for top-heavy 
IMFs with x = 0. 

5. CAVEATS AND UNCERTAINTIES 

Although straightforward in principle, the IMF test dis- 
cussed in this paper has several associated uncertainties, some 
having to do with the data and some with the models that are 
used to fit the data. 

5.1. Effects of Sample Selection and Progenitor Bias 

Every cluster for which the required data were available in 
the literature and/or the HST archive was included in the anal- 
ysis, and as they come from a wide range of projects by many 
different research groups this may have introduced system- 
atic errors. In particular, it may be that the sample of clusters 
with FP measurements is somehow different from the sample 
of clusters with color measurements. This potential bias is 
explicitly addressed in § 14.11 where the analysis was limited 
to clusters in common between the two samples. As a further 
test, the analysis in § |4.2| was repeated after removing the three 
clusters with z > 0.83 from the sample. These clusters have 
a large weight in the M/L analysis as they are at the highest 
redshifts, and they have no counterparts in the color sample. 
The effects are small, but not negligible: for the super-Solar 
model the best-fitting IMF slope changes from -1.2 to -1.0 
and the best fitting formation redshift changes from 7.6 to 5.9 
when the highest redshift clusters are removed. 

Redshift-dependent selection effects such as progenitor bias 
are not a major concern in this particular study. If we system- 
atically miss the youngest progenitors of today's clusters, or 
the youngest progenitors of today's early-type galaxies within 
clusters, the formation redshifts that we derive may be too 
high but the IMF results should be robust. The IMF is con- 
strained by the ratio of the amount of luminosity evolution to 
the amount of color evolution, and progenitor bias would (to 
first order) equally affect the colors and luminosities. This 



can be tested explicitly by varying the progenitor bias correc- 
tion to the M/L ratios and colors, and repeating the analysis. 
Increasing progenitor bias by a factor of two gives the same 
IMF slope (-1.2) as the model adopted in § 14.21 and a lower 
formation redshift (4.8 rather than 7.6). Setting the progeni- 
tor bias to zero again gives the same IMF slope (-1.2), and a 
formal best-fitting formation redshift Zform = oo. 

5.2. Systematic Errors in the Photometry 

As discussed in § l3.3.2l the low cluster-to-cluster scatter in 
the color evolution implies that the errors in the photometry 
are more likely overestimated than underestimated. Never- 
theless, absolute photometry is challenging, and it is difficult 
to rule out subtle systematics in the various corrections and 
transformations that have been applied to the data. Increas- 
ing the errorbars on the colors by 50 % (much more than is 
plausible - see § 13. 3. 2b obviously loosens the constraints on 
the IMF, but not by a large amount. Repeating the analysis of 
§ 14.1 [ changes the upper limits on the IMF slope from x < 0.1 
(for super-Solar metallicity) and x < 0.9 (for Solar metallic- 
ity) to X < 0.4 and x < 1 . 1 respectively. 

Aside from general concern about the assumed errors, a 
specific worry is the tie between the low redshift data and the 
(homogeneous) synthetic photometric system that is used for 
the distant clusters. Absolute photometry in the U band is par- 
ticularly difficult: the bandpass is strongly influenced by the 
detector and the atmosphere, and the spectral energy distribu- 
tions of old galaxies have a very steep slope around 3800 A. 
We dealt with this issue in § l3.3.2l bv adding a 3 % systematic 
uncertainty in the Coma offset. A more conservative approach 
is to simply discard Coma altogether. Interestingly, remov- 
ing Coma has the effect of tightening the limits on the IMF 
slope. As an example, for Solar metallicity the 90 % upper 
limit changes from 0.9 to 0.3. The reason for this is readily 
apparent from Fig.|5] Coma is the point with the largest devia- 
tion from the best-fitting line, and the relation becomes much 
steeper when it is removed.^ We note that the mean M/Lg 
ratio of Coma galaxies may also be somewhat anomalous, as 
it deviates from the best-fitting relation in Fig. [6^. Adopting 
the M /Lb ratio from this fit rather than the actual value brings 
the cluster in close agreement with the solid line in Fig.|5] 

5.3. Interpretation of the Fundamental Plane and M/L 
Ratios 

The evolution of the mean M/L ratio, shown in Fig. |6ja), 
is derived from the evolution of the fundamental plane (FP) 
relation (Djorgovski & Davis 1987). As discussed in detail in 
several papers (e.g., Franx 1993; van Dokkum & Franx 1996; 
Treu et al. 2001) the empirical FP relation can be rewritten 
as a relation between M/L ratio and mass, if it is assumed 
that M oc cr^r^ (with a the stellar velocity dispersion and r^ 
the effective radius) and early-type galaxies are a homologous 
family. If these assumptions are valid the observed evolution 
of the FP should track the evolution of the zeropoint of the 
underlying M/L - mass relation. However, this may not be 
the case if galaxies undergo significant structural changes over 
time, due to merging or other processes (see, e.g., Almeida, 
Baugh, & Lacey 2007). 

Strong gravitational lenses provide an independent check 
on the FP interpretation. Current results indicate that the evo- 

^ Abell 22 1 8 is the point with the largest deviation in the opposite sense to 
Coma; arbitrarily removing this cluster (while retaining Coma) changes the 
90 % upper limit from 0.9 to 1.3 for Solar metallicity. 



van Dokkum 



11 



lution of the M /L ratio as derived from strong lenses is con- 
sistent with that derived for field early-type galaxies (Rusin 
& Kochanek 2005), although the systematic uncertainties in 
this comparison are still fairly substantial (see, e.g., Treu & 
Koopmans 2004; van der Wei et al. 2005; Rusin & Kochanek 
2005). Recently van der Marel & van Dokkum (2007) used 
spatially-resolved photometric and dynamical observations of 
cluster early-type galaxies at z w 0.5 to directly test the as- 
sumption that the FP tracks M /L evolution. M /L ratios de- 
rived from modeling the spatially-resolved data were directly 
compared to those determined from the FR Interestingly, the 
validity of the FP for determining M/L ratios appears to de- 
pend on the mass: for galaxies with masses > 10" or ve- 
locity dispersions > 200 km s"' the M/L ratios are consistent 
with each other, but for low mass galaxies the FP appears to 
systematically underestimate the M/L ratio. Although these 
results suggest that the traditional interpretation of FP evolu- 
tion is substantially correct at the high mass end, further tests 
would be valuable, particularly at z > 0.5. 

Another complication is that the measured M/L ratio is not 
identical to the stellar M/L ratio calculated in stellar popu- 
lation synthesis models. In the models is the sum of the 
mass of all living stars and the mass of stellar remnants (black 
holes, neutron stars, and white dwarfs). By contrast, the mea- 
sured mass includes all possible contributions, including dark 
matter and gas. Although the amount of dark matter in ellip- 
tical galaxies is still somewhat uncertain (e.g., Romanowsky 
et al. 2003; Mamon & Lokas 2005; Cappellari et al. 2006), 
its contribution within the effective radius is probably small 
(Mamon & Lokas 2005; Koopmans et al. 2006). The contri- 
bution of gas to the total mass is also uncertain. One would 
expect substantial amounts of gas and dust from stellar mass 
loss (see, e.g., Goudfrooij et al. 1994), qualitatively consis- 
tent with Spitzer observations (Temi, Brighenti, & Mathews 
2007). However, the data appear to be inconsistent with sim- 
ple expectations, and Temi et al. (2007) suggest that the gas is 
transported outward, possibly due to AGN activity. Further- 
more, in galaxy clusters the galaxies move through very hot, 
diffuse gas at great speed and it seems likely that any cold 
gas coming from stellar winds is fairly efficiently stripped or 
heated. 

Although dark matter and gas could contribute to the mea- 
sured M/L ratio, these contributions will to large extent can- 
cel when comparing galaxies over a range of redshifts. In 
the Maraston (2005) models the amount of mass loss over the 
relevant age range from 5Gyr to 12Gyr is only « 2% for 
a Salpeter (1955) IMF and w 3 % for a Kroupa (2001) IMF 
Even if half of this mass was retained rather than lost to the in- 
tracluster medium, the M/L evolution would change by only 
1-2%. 

5.4. Uncertainties in Stellar Population Synthesis Models 

Uncertainties in the stellar population synthesis models are 
perhaps the largest source of error, and at the same time they 
are notoriously difficult to quantify. The fundamental prob- 
lem is that the main direct tests and calibrations of the models 
are offered by Galactic open clusters and globular clusters, but 
that these may not be representative for the integrated stellar 
populations of massive early-type galaxies. 

Recently the confidence in these models was somewhat 
shaken by the large differences that exist between predictions 
of Bruzual & Chariot (2003) and Maraston (2005). As dis- 
cussed in detail by Maraston (2005) and others (e.g., van der 
Wei et al. 2006b; Maraston et al. 2006), the predicted rest- 



frame optical to near-infrared colors of simple stellar popu- 
lations can deviate by > 0.5 magnitudes for identical input 
metallicities and ages. The differences are in large part due to 
a different numerical implementation of the thermally pulsat- 
ing asymptotic giant branch (TP-AGB) phase of stellar evolu- 
tion, and are therefore largest for ages 0.5-2 Gyr and colors 
which include a near-infrared band (such as B-K).^ 
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Fig. 8. — Color-color diagram for galaxies in the Coma cluster. 
Colors were obtained from Eisenhardt et al. (2007), for galaxies in 
common with the J0rgensen et al. (1995a,b) FP sample. Solid sym- 
bols indicate galaxies with masses > lO" Mq. The lines attached to 
the points show the effects of changing the photometry aperture from 
10"3 to 6."2. Colored lines show model predictions from Maraston 
(2005) for a 12 Gyr model with metallicity -0.33 < [Fe/H] < +0.35. 
Different colors indicate different IMF slopes: x= 13 (red), x = 0.7 
(green), and x = (blue). The models fit the observed colors well. 

Our analysis avoids the region of parameters space where 
the differences between the models are most pronounced 
(ages of 0.5-2 Gyr and rest-frame near-infrared passbands). 
The models are better calibrated (and in much better agree- 
ment) for ages > 3 Gyr and rest-frame optical passbands, 
largely because the contribution of giants to the integrated 
light is significantly smaller. However, subtle differences be- 
tween the models also exist at ages > 3 Gyr; as an example, 
the B-K colors of the Maraston (2005) models are slightly 
bluer than those of Bruzual & Chariot (2003) (see van der 
Wei et al. 2006b). 

To get some handle on the effects of the choice of model 
the derivation of Eq. |5] was repeated for a Solar metallicity 
Bruzual & Chariot (2003) model. By comparing the color 
evolution to the M/L evolution for two different IMF slopes 
(x = 1.35 and x = 0.35) we find kb/ku-v ~ 5.0- 1.2x. The re- 
lation is less steep than that of the Solar metallicity Maraston 
(2005) model, and is close to Maraston's super-Solar metal- 
licity model (the solid line in Fig.|5]l. For a Salpeter IMF and 
Solar metallicity the Bruzual & Chariot (2003) model pro- 
vides an even worse fit to the data than the Maraston (2005) 

* It should be noted that the treatment of TP-AGB stars is revised in the 
most recent incarnation of the Bruzual & Chariot models, which brings them 
more in line with the Maraston (2005) models (see Bruzual 2007). 
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model, and the measured slope of the M/L - color relation im- 
plies X ~ -0.4 (compared to x -0. 1 for the Solar metallicity 
Maraston model). We infer that our conclusions are robust in 
the context of presently popular models, but note that there 
are significant variations in the model predictions even in the 
rest-frame optical. 

Finally, an important check on the entire enterprise is 
whether the models can match the absolute colors of the 
galaxies. So far, we have not used the absolute colors in any 
way and only considered the rate of color evolution, leaving 
the normalization a free parameter. The absolute colors are 
difficult to interpret, as they suffer from strong degeneracies 
between age, metallicity, and dust content. Nevertheless, the 
models should be able to reproduce the observed colors of 
massive galaxies in Coma for plausible combinations of these 
parameters. Figure [8] shows the location of galaxies in the 
Coma cluster inacolor-color diagram ofV-/? vs. U-V, along 
with model predictions of Maraston (2005). All models have 
an age of 12 Gyr, corresponding to Zfoim ~ 4. The models have 
approximately the correct U -V colors at z = for Zform ~ 4, 
and are therefore self-consistent. The fact that the agreement 
is particularly good for models with Solar metallicity and x = 
is probably coincidence, given the uncertainties. 

5.5. Blue Stars in Old Stellar Populations 

As is well known old stellar populations can include very 
luminous hot stars. These stars are rare, but due to their high 
luminosity they can have a significant effect on the integrated 
luminosity of a stellar population, particularly in blue pass- 
bands. These hot stars fall in three broad categories: blue 
horizontal branch stars (e.g., Zinn, Newell, & Gibson 1972; 
Rich et al. 1997), extreme horizontal branch stars (e.g.. Dor- 
man, Rood, & O'Connell 1993), and blue stragglers (BSs) 
(e.g., Sandage 1953; Bailyn 1995). Blue horizontal branch 
stars are generally included in current stellar population syn- 
thesis models. Extreme horizontal branch stars are thought 
to be responsible for the UV-upturn in elliptical galaxies at 
A < 2000 A (Burstein et al. 1988), and should not affect the 
U-V colors (see, e.g., Yi et al. 2003, 2005). 

Blue stragglers are not included in stellar population syn- 
thesis models, probably because conditions for their forma- 
tion are still not well understood. However, they can have a 
significant effect on the integrated near-UV and optical colors 
of a stellar population. Xin & Deng (2005) find that the con- 
tribution of BSs to the integrated B - y color of old open clus- 
ters is typically ~ 0.2 magnitudes. Their contribution to the 
U-V color has not been measured directly, but is expected to 
be ^ 0.3 magnitudes or so. In practice, BSs are not included 
when fitting isochrones to the stellar populations of open clus- 
ters or globular clusters, but we do not have that luxury for 
unresolved stellar populations. 

The effect of a BS population such as that in M67 would be 
larger than the IMF effects discussed in this paper, and there is 
no a priori reason to suppose that BSs do not exist in elliptical 
galaxies (see, e.g., Stryker 1993). Nevertheless, they proba- 
bly do not have a very large effect on our results. First, the 
relevant question is not how much BSs contribute to the inte- 
grated colors, but to what extent they influence their evolution 
from ^ 5 Gyr (the age at z = 0.83 for Zfoim 4) to ~ 12 Gyr 
(the age at z = 0). To have a significant impact on our results 
BSs would have to have been absent in elliptical galaxies for 
five or more Gyr and then have "turned on". This seems rather 
contrived, particularly since the turn-off mass changes by only 

0.4 M0 over this age range. If anything, one might expect 



BSs to be slightly more prevalent at younger ages. One of 
the preferred models for their formation^ is through the coa- 
lescence of binary stars (Mateo et al. 1990), and if binaries 
start out with a flat distribution of orbital parameters the rate 
at which some of them turn into BSs will gradually decrease. 
Coupled with the fact that BS lifetimes are short compared to 
the main sequence stars, this could lead to a gradual decline 
in the blue straggler population. More to the point, Xin & 
Deng (2005) have determined the contribution of BSs to the 
integrated light of open clusters with a large range of ages. 
They find no correlation between the contribution of BSs to 
the integrated colors and the age of the cluster. As a result, 
the integrated colors of clusters with ages varying from 1 Gyr 
to 8 Gyr are reasonably well described by a single stellar pop- 
ulation synthesis model with a constant offset. 

Another reason why the effects of BSs are probably fairly 
mild is that it is difficult to hide them completely in nearby 
elliptical galaxies. If BSs are as prevalent in elliptical galax- 
ies as they are in open clusters, they would lead to relatively 
blue near-UV colors and strong Balmer absorption lines. Pop- 
ulation synthesis models would (correctly) infer the presence 
of A and F main sequence stars, and (incorrectly) attribute 
them to a secondary young stellar population, and/or assign 
an overall young luminosity-weighted age to the galaxy. As 
pointed out by Schiavon, Caldwell, & Rose (2004), if a galaxy 
had the same stellar population as M67 it would be classified 
as "Eh-A" (Dressier & Gunn 1983) to signify the presence of a 
post-starburst component in addition to an older component. 
Xin & Deng (2005) quantify these effects and conclude that 
the ages of unresolved stellar populations will typically be un- 
derestimated by a factor of 2 if blue stragglers are present 
in similar numbers as in open clusters. Turning to many years 
of population synthesis modeling of massive elliptical galax- 
ies in clusters, such dramatic effects can be safely ruled out. 
As an example, Thomas et al. (2005) infer ages of ^ 12 Gyr 
(i.e., close to the age of the Universe) for cluster galaxies with 
M > 10'' Mq, not 6 Gyr as one might expect from a BS- 
rich stellar population. Qualitatively, the red colors and weak 
Balmer lines of massive cluster ellipticals place strong limits 
on the relevance of BSs in these galaxies. 

Finally, we note that Trager et al. (2000b) argue that BSs do 
not contribute significantly to the integrated light even in el- 
liptical galaxies that do have enhanced Balmer lines and signs 
of young populations (typically relatively low mass field ellip- 
ticals). Rather than discuss their arguments, we note that the 
finding that the derived age of ellipticals appears to correlate 
with the presence of morphological fine structure (Schweizer 
& Seitzer 1992) provides additional support for the notion that 
late star formation, rather than a BS population, is generally 
responsible for the young appearance of some nearby ellipti- 
cals. We are left to wonder why BSs seem to be deficient in 
elliptical galaxies, as compared to old open clusters. No spec- 
ulation will be offered here, as this question is well outside 
the scope of the present study. 

5.6. Complex Evolution and Dust 

As argued in § 12.31 the population of massive early-type 
galaxies in clusters is not thought to have experienced sig- 
nificant star formation at redshifts <C 1 . Young stellar popula- 
tions have been seen in some field ellipticals (e.g., Trager et al. 
2000a; Thomas et al. 2005), but a large body of observational 

^ At least in old open clusters; other mechanisms may dominate at young 
ages and in other environments. 
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evidence supports the notion that massive cluster galaxies are 
remarkably homogeneous (e.g., Bower et al. 1992a; Stanford 
et al. 1995, 1998; Ellis et al. 1997; Kodama et al. 1998; van 
Dokkum et al. 2000; Holden et al. 2004; Thomas et al. 2005; 
Tran et al. 2007). This apparent homogeneity may be mis- 
leading and mask more complex evolution (van Dokkum & 
Franx 2001), but this is not likely for the most massive galax- 
ies (Holden et al. 2007). It may well be that cluster early- 
types experienced more complex evolution at z > 1, in par- 
ticular during and before virialisation of the clusters in which 
they now live, but this does not affect the analysis. The pa- 
rameter Zform does not refer to a single star formation event, 
but reflects the luminosity-weighted mean formation epoch of 
the stars. As shown explicitly in van Dokkum et al. (1998b) 
color and luminosity evolution are nearly identical for stel- 
lar populations with different star formation histories but the 
same luminosity-weighted age, as long as star formation ter- 
minated at least ~ 1 Gyr prior to the epoch of observation. 

Dust needs to be treated separately, as significant amounts 
of gas and dust are expected to exist in early-type galaxie s as a 
result of winds emanating from massive stars (see also § l5.3l l. 
Cluster galaxies at z ~ 0.8 may still have retained a significant 
fraction of the dust expelled by massive stars, whereas it may 
have been stripped, blown out, or obliterated at later times 
(see, e.g., Goudfrooij et al. 1994; Temi et al. 2007). Quanti- 
fying these effects is very difficult, as they not only depend on 
the amount of mass loss but also on the detailed processes and 
timescales for cooling and heating of the dust. Qualitatively, 
one might expect higher reddening at z 0.8 and therefore a 
lower apparent color evolution from z ~ 0.8 to z ^ 0. The true 
color evolution would then be stronger, and the data might be 
consistent with a normal IMF in combination with a relatively 
low stellar age. 

However, dust would not only affect the colors but also the 
luminosities. The relation between reddening and extinction 
in elliptical galaxies is not well known, but is probably not 
very different from that in the Milky Way (e.g., Goudfrooij 
et al. 1994). For reference, arrows in Fig.|5]show the effects 
of Milky Way-like dust and of the (relatively grey) Calzetti 
(1997) extinction curve. Calzetti-like dust would have essen- 
tially no effect on the analysis, as the dust vector is nearly 
parallel to the observed relation between M /L ratio and color. 
Milky Way-like dust would have an effect, but would lead to 
unrealistically low ages for the galaxies. To change the ob- 
served relation to the one expected for a standard IMF the 
z = 0.8 galaxies would have to be reddened by ^ 0.3 mag in 
U -V , corresponding to a Z?-band extinction of ~ 0.6 mag. 
The implied luminosity evolution from z = 0.8 to the present 
would then be 1.7 magnitudes (instead of 1.1 magnitudes; 
see vv07). This in turn would mean that massive cluster galax- 
ies formed their stars at z 1.3, which is ruled out by many 
independent observations (e.g.. Kelson et al. 2001; Thomas 
et al. 2005; Kodama et al. 2007). 

5.7. Summary of Fitting Results and Associated 
Uncertainties 

The main result from the fits in §|4] is that stellar popula- 
tion synthesis models with a standard, Salpeter-like IMF in 
the region around 1 Mq are not able to simultaneously fit the 
luminosity and color evolution of massive cluster galaxies. 
By contrast, models with flat IMFs and high stellar formation 
redshifts provide excellent fits. The slow color evolution mea- 
sured here is not easily explained by invoking sample selec- 
tion effects, photometric errors, a blue straggler population. 



or differential dust extinction. 

The formal limits on the IMF slope and formation redshift 
are obviously very uncertain, as they do not take any of the 
effects discussed above into account. Furthermore, negative 
IMF slopes are outside of the parameter space covered by the 
models, which makes the formal Super-solar metallicity so- 
lutions somewhat suspect. In what follows the formal Solar 
metallicity solutions (x = -O.3;!;07 and Zform = will be 

used as a "default" model, but it should be kept in mind that 
the errorbars almost certainly underestimate the true uncer- 
tainty. 

6. COMPARISON TO PREVIOUS RESULTS 

Many previous studies have determined or constrained the 
properties of the stellar populations of early-type galaxies. 
Although direct comparisons are difficult because of differ- 
ences in sample selection and methodology, it is important to 
verify that our results are broadly consistent with the large 
body of literature that exists on this topic. 

6.1. Other studies of the Evolution ofM/L Ratios and 
Colors 

As discussed in detail in vv07, the values shown in Fig.|6t 
are mostly determined from literature data, and therefore "au- 
tomatically" consistent with a large body of previous work. 
The only exception is J0rgensen et al. (2006), who find sub- 
stantially slower evolution in the first version of their pub- 
lished paper. However, this was due to a conversion error, 
and the vv07 results are fully consistent with the Erratum to 
J0rgensen et al. (2006) published in 2007. 

The evolution of the rest-frame U -V color cannot be di- 
rectly compared to previous work, as most studies choose to 
compare the observed colors to redshifted model predictions 
rather than convert the data to the rest-frame. Furthermore, 
color evolution has not been studied previously for mass- 
limited samples of cluster galaxies. Nevertheless, it is inter- 
esting to compare the constraints on the stellar ages that are 
reported in the literature, as they should reflect the measured 
rate of color evolution. 

Most studies of the color evolution of early-type galaxies 
in clusters find very old ages, in qualitative agreement with 
the results found here (e.g., Stanford et al. 1995, 1998; Ko- 
dama et al. 1998; Koo et al. 2005). In particular, no stud- 
ies have reported formation redshifts as low as ~ 2 for the 
most massive galaxies in clusters, which is the value im- 
plied by the evolution of the M /L ratios for a standard IMF. 
The most comprehensive study to date is by Holden et al. 
(2004), who combined data from Stanford et al. (1998) with 
new measurements to obtain a combined sample of 3 1 clus- 
ters at 0.3 < z < 1.3 with multi-band photometry. They find 
Zform = 5^1^ from Solar metallicity Bruzual & Chariot (2003) 
models, in good agreement with our results. It should be noted 
that the Holden et al. (2004) study may suffer from some of 
the same systematic effects as the work described here; for ex- 
ample, an emerging blue straggler population would influence 
all studies of color evolution. 

6.2. Evolution of Balmer Absorption Lines 

Kelson et al. (2001) study the evolution of the Balmer ab- 
sorption line strengths of early-type galaxies in clusters at 
0.06 < z < 0.83, and their data offer an independent check on 
the results derived here. The Balmer lines should behave like 
the U -V colors, in the sense that they are fairly insensitive 
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to the IMF and their rate of evolution is mostly determined by 
the age of the stellar population. Importantly, the sources of 
systematic error (both in the data and in the models) are very 
different from those that affect the colors (see, e.g.. Kelson 
et al. 2006). 

The Kelson et al. (2001) data are shown in Fig. |9] The 
data are compared to two Solar metallicity Bruzual & Chariot 
(2003) models: a model with a standard IMF and a young 
age, and a model with a top-heavy IMF and an old age. As 
discussed in vv07 and in § 14. 21 these are the only two classes 
of models which provide good fits to the observed evolution 
of the M /L ratio. The young model is clearly inconsistent 
with the data, and can be rejected with high confidence. By 
contrast, the top-heavy/old model fits the data very well. This 
model is the formal best fit to the M /L and color evolution (as 
discussed in § 15. 7b . but other models with formation redshifts 
in the range 3-6 also fit well. 

The evolution of the Balmer lines thus provides strong in- 
dependent support for the analysis in §|4] We note that the 
inferred ages are consistent with the analysis in Kelson et al. 
(2001): they give 95 % confidence lower limits to Zform of 2.4 
and 2.9 for two different stellar population synthesis models. 
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Fig. 9. — Evolution of the Balmer absorption line strengths of early- 
type galaxies in clusters, taken from Kelson et al. (2001). The lines 
show the two classes of models that fit the evolution of the M/L 
ratio of early-type galaxies: young models with a Salpeter IMF and 
old models with a top-heavy IMF. Only the old/top-heavy model fits 
the linestrength data, providing independent support for our results. 



6.3. Evolution of Field Early-Type Galaxies 

Field early-type galaxies probably evolve in more com- 
plex ways than their counterparts in clusters. Galaxy-galaxy 
mergers are expected to be rare in virialized clusters (e.g., 
Makino & Hut 1997) but could be common in groups (e.g., 
van Dokkum 2005; Bell et al. 2006), and the same is prob- 
ably true for late star formation (e.g., Trager et al. 2000a). 
Field samples may therefore not be suitable for measuring the 
subtle effects discussed in this paper Nevertheless, it is inter- 



esting to note that van der Wei et al. (2006a) find that Bruzual 
& Chariot (2003) models with a standard IMF do not fit the 
observed relation between B-K color and M /Lb ratio of field 
early-type galaxies at < z < 1 . Models with top-heavy IMFs 
fit better, although Maraston (2005) models with a standard 
IMF also provide good fits. 

6.4. "Red and Dead" Galaxies at High Redshift 

Recent studies of faint /T-selected samples have identified 
a population of apparently "red and dead" galaxies at z « 2.5 
(Franx et al. 2003; Labbe et al. 2005; Daddi et al. 2005; 
Kriek et al. 2006). Deep rest-frame optical spectra demon- 
strate strong Balmer breaks and no line emission in a sub- 
stantial fraction of massive galaxies at these redshifts, imply- 
ing that star formation in these objects mostly terminated at 
z « 3 or earlier (Kriek et al. 2006). These objects are found 
in blind surveys of relatively small fields, and it seems likely 
that many galaxies in (proto-)clusters terminated star forma- 
tion even earlier, and/or that the fraction of (nearly) passive 
galaxies is higher in those environments (see, e.g., Thomas 
et al. 2005; Steidel et al. 2005; Quadri et al. 2007; Ko- 
dama et al. 2007). Assuming that these galaxies are not 
"rejuvenated" at later times (which is entirely possible), the 
presence of substantial numbers of apparently old galaxies at 
this redshift places an interesting constraint on the luminosity- 
weighted ages of massive early-type galaxies. 

For any star formation history the time of last star forma- 
tion will be more recent than the luminosity-weighted mean 
star formation epoch. After a (5-function, the simplest star 
formation history is a top-hat with a constant star forma- 
tion rate from t = fstart to f = fstop- Such models may actu- 
ally be fairly realistic (as compared to exponentially declin- 
ing models, for instance), as they superficially resemble the 
star formation histories of massive galaxies in hydrodynam- 
ical simulations (Nagamine et al. 2005). As shown in van 
Dokkum et al. (1998b), there is a simple relation between 
the luminosity- weighted age f-fform and fstop in these models: 

t - fform ~ V(f-f,start)(f- fstop). Assuming that Zstop ~ 3 (ex- 
trapolating from the Kriek et al. results for red field galaxies), 
the luminosity- weighted mean formation epoch Zform ^3.7 for 
fstop - fstaii > 1 Gyr. 

Luminosity-weighted formation redshifts of ^ 4 are en- 
tirely consistent with the old/top-heavy model that is implied 
by the analysis in §|4] Formation redshifts of 2-2.5, which 
are implied by the M/L ratio evolution for Salpeter-like IMFs 
(see vv07 and §|4|i may be difficult to reconcile with the pres- 
ence of apparently old field galaxies at z ~ 2.5. We note, how- 
ever, that this type of evidence should be treated with caution, 
as the old high redshift objects could constitute only a small 
fraction of the progenitors of today's early-type galaxies. 

6.5. Linestrengths of Nearby Elliptical Galaxies 

Many studies have constrained the properties of the stel- 
lar populations of nearby early-type galaxies using absorption 
line strengths (e.g., Peletier 1989; Worthey et al. 1992; Trager 
et al. 2000a; Thomas et al. 2005). Linestrengths are generally 
more helpful than colors, as they suffer slightly less from the 
strong degeneracies that exist between (particularly) age and 
metallicity (e.g., Worthey 1994). One of the difficulties in 
linestrength studies is that the abundance ratios of early-type 
galaxies do not match Solar models (Peletier 1989), which in 
itself may be indirect evidence for a top-heavy IMF at early 
times (Worthey et al. 1992). Thomas et al. (2005) argue 
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that models which explicitly include non-Solar abundance ra- 
tios can date the star formation epoch of early-type galaxies. 
These models indicate that cluster early-type galaxies formed 
their stars at z = 3-5, in good agreement with the ages derived 
here for top-heavy IMFs. 



6.6. Absolute M/L Ratios of Nearby Elliptical Galaxies 

In §|4]we were only concerned with the evolution of the de- 
rived M/L ratios, and not with their absolute values. The ab- 
solute M/L ratios of nearby elliptical galaxies provide a pow- 
erful additional constraint on the IMF, as the stellar M/L ratio 
implied by the combination of the IMF and the observed lu- 
minosity should not exceed the dynamical M/L ratio. As dis- 
cussed in § 1 the observations of Cappellari et al. (2006) are 
consistent with the stellar mass implied by a Chabrier (2003) 
IMF and rule out IMFs which are significantly steeper than 
the Salpeter value, because they have too much mass locked 
up in low mass stars. 

Perhaps surprisingly, the dynamical M/L ratios also rule 
out top-heavy IMFs of the form fitted in §|4] These IMFs 
have less low mass stars than a standard Salpeter (1955) IMF 
but many more high mass stars. As a result, the stellar mass 
function at late times is "remnant heavy", dominated as it is by 
white dwarfs, neutron stars, and blackholes. Maraston (1998) 
shows that the M/L ratio at old ages reaches a minimum for 
X ~ 1.3: for larger values the IMF is dwarf-dominated and for 
smaller values the IMF is remnant-dominated. Specifically, 
Solar metallicity Maraston (2005) models predict M/Lb = 9 
at 12 Gyr for jc = 1 .35 and M/Lb = 87 for x = 0! Such high 
M/L ratios are an order of magnitude higher than measured 
(see vanderMarel 1991; Cappellari et al. 2006; Gavazzi et al. 
2007, and many other studies), which means that IMFs with 
a constant logarithmic slope x = Q over the entire mass range 
0.1 - 100 M0 can effectively be ruled out. In the next Section 
we will explore IMFs which have a Salpeter-like slope at high 
masses and x ~ in the region around 1 Mq. Such IMFs are 
physically more plausible than IMFs with a constant slope 
and do not violate dynamical limits on the stellar M/L ratios 
of nearby galaxies (see § 17. 4t . 



7. IMPLICATIONS 
7.1. Implications for the Characteristic Mass at z ^ 4 

In the Mai-aston (2005) models described in §122] the IMF 
is parameterized as a power law with constant slope x over 
the entire mass range 0.1 - 100 Mq. The best-fitting slope of 
X ~ implies an extremely top-heavy mass function, with a 
very large number of massive stars. However, other forms 
of the IMF are also consistent with the data, as the analysis 
of §|4]is completely insensitive to the IMF at masses exceed- 
ing the turn-off mass at 5 Gyr (« 1.15Mo). A physically 
more plausible IMF would have a Chabrier (2003) form, with 
a fixed high-mass slope of x « 1.3 and a varying characteristic 
mass nic. As will be quantified later, such IMFs are not only 
physically motivated but are also entirely consistent with the 
dynamical M/L ratios measured for nearby ellipticals. 

So far, no stellar population synthesis modeling has been 
done with Chabrier IMFs with varying m^. However, we can 
use the fact that our observations probe the IMF only over a 
limited stellar mass range to derive an approximate relation 
between and the slope of the IMF as parameterized in the 
Maraston (2005) models. The Chabrier (2003) IMF has the 



form 

J A/exp [-(\ogm-\ogmcf /2a^] (m < IM©) 
^ \ A,,m--^ (m>lMo), ^ ' 

with Al = 0.158, nic = 0.079 Mq, a = 0.69, A,, = 0.0443, and 
X = 1.3, and is shown by the dashed lines in Fig. [10] As shown 
in Chabrier (2003) this IMF is very similar to the Kroupa 
(2001) IMF, and both IMFs are nearly identical to the Salpeter 
( 1955) IMF for m> IMq. The bottom panel of Fig.fTOlshows 
the derivative ii(^(logm))/(i(logm), which can be thought of 
as the local logarithmic slope of the IMF. The slope gradu- 
ally steepens until leveling off at the Salpeter value for masses 
m> I Mq . 




Fig. 10. — Different forms of the IMF (a), and their derivatives (b). 
Dotted lines indicate the Salpeter (1955) IMF, with constant logarith- 
mic slope X = 1.35. The dashed lines show the Chabrier (2003) disk 
IMF, which turns over at ~ 0.08 Mq. Solid lines show a slightly 
modified form of the Chabrier (2003) IMF, which eliminates the dis- 
continuity at m = 1 Mq and explicitly accommodates a varying m^. 



The derivative of the Chabrier (2003) IMF has a disconti- 
nuity at 1 Mq, where the log-normal part of the IMF and the 
power law part connect. This discontinuity is rather minor 
and has no practical implications, but it becomes more pro- 
nounced when considering higher values of nic- Therefore, 
we adopt an extension of the Chabrier (2003) disk IMF pa- 
rameterization which does not produce discontinuities in the 
derivative and explicitly accommodates a varying iric: 

^_ \ A/(0.5nc'Wc)~* exp [-(logm-logmc)^/2(T^] (m < HcOTc) 
\ A/,m"-' (m > Hc'Mc), 

(18) 

with A/ = 0.140, He = 25, and cr. A/,, and x identical to Eq. 
[T7](see Chabrier (2003) for an alternative parameterization of 
IMFs with a high characteristic mass). The behavior of this 
modified Chabrier IMF for nic = O.O8M0, nic = 0.4 M©, and 
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nic = 2Mq is shown in Fig. fTOl' a) and (b). For nic = 0.08 this 
IMF is nearly identical to the Chabrier (2003) disk IMF. 

The observations discussed in this paper are sensitive to the 
IMF in a limited stellar mass range near 1 Mq . The B-band 
luminosity-weighted mean stellar mass of a stellar population 
is typically slightly smaller than the turn-off mass, and ranges 
from I.O-O.8M0 for ages of 5-12Gyr As is evident from 
Fig. [TO]the IMF can be approximated by a powerlaw over this 
small mass range, and the slope of this powerlaw depends on 
nic- We find 

logm,. « -0.05-1. Ixi (xi<1.3) (19) 

with xi = -dlog£^/dlogm the logarithmic slope of the IMF 
near m « IMq. This expression relates Salpeter-like IMFs 
with varying slope x, as used in the Maraston (2005) models 
and Eqs. [3]and|4] to physically more plausible Chabrier-like 
IMFs with fixed high mass slope and varying characteristic 
mass. 

The best-fitting slope of the IMF as derived from Sola r 
metallicity Maraston (2005) models is x = -0.3;!;q 7 (see § 15. 7b . 
In the context of a Chabrier-like IMF this slope implies a char- 
acteristic mass nic = ^■9'!i\ lMQ at z = 3.7;|;q^. For super-Solar 
metallicity the best-fitting values of x are strongly negative 
and may imply ^ IOM0, although this represents a sub- 
stantial extrapolation of the models. 

We conclude that the color- and luminosity evolution of 
early-type galaxies are consistent with IMFs which have a 
Salpeter slope at high masses but turn over near ~ 1 Mq. Such 
an IMF is perhaps best described as "bottom-light" rather than 
top-heavy: it does not have a larger number of massive stars 
than a standard Chabrier (2003) IMF, but has a deficit of low 
mass stars. The form of the IMF at masses < 1 Mq is not 
constrained by the data presented here, and it is an open ques- 
tion whether the slope of the IMF actually becomes negative 
at low masses or stays constant at x ^ 0. 

7.2. Evolution of the Characteristic Mass 

The evolution of the characteristic mass is shown in Fig. 
nn The results obtained in this study are represented by the 
formal best fit for Solar metallicity, as discussed in § 14.21 The 
point labeled 'SMGs' reflects results of Blain et al. (1999a) 
for submm galaxies (SMGs). Blain et al. (1999a) find that a 
standard Salpeter (1955) IMF has too many low mass stars, 
which would produce too much A'-band light at z = a similar 
argument was made by Dwek et al. (1998, based on CORE 
data). Blain et al. find that a top-heavy IMF with a simple cut- 
off at 1 Mq resolves this discrepancy. No errorbaris given, but 
we can assume that the uncertainty in the amount of "missing 
mass" is at least 50 %. The value we adopt for = 0.34, to 
match the total stellar mass of the truncated Salpeter (1955) 
IMF invoked by Blain et al. (1999a). The redshift comes from 
Chapman et al. (2003). 

The Figure also includes the IMFs of the Milky Way disk 
(from Chabrier 2003) and of globular clusters. The age for 
globular clusters reflects the results of Gratton et al. (2003), 
who find that globular clusters in the inner halo have ages 
ranging from ~ 13.4Gyr to ^ 10.8 Gyr with a systematic un- 
certainty of w 0.6 Gyr The characteristic mass was deter- 
mined by Paresce & De Marchi (2000), who infer a typical 
characteristic mass = 0.33, with formal uncertainty ±0.03. 
The true uncertainty is probably somewhat larger, as dynam- 
ical effects may play a role even at low masses (see, e.g., 
de Marchi, Paresce, & Portegies Zwart 2005). The Galac- 
tic bulge is not included; no estimates have been made for the 



characteristic mass of the bulge IMF, but we note that there is 
evidence that the IMF in the range 0.15 - 1 Mq is flatter than 
a Salpeter IMF (Zoccali et al. 2000). 

The broken lines in Fig.[TT|are examples of models in which 
the temperature of the CMB effectively sets the characteris- 
tic mass scale of star formation at high redshift (see Larson 
1998, 2005; Jappsen et al. 2005). The CMB temperature 
exceeds the typical temperatures in prestellar cores in molec- 
ular clouds of 8 K (e.g., Evans et al. 2001) beyond z ^ 2. 
As discussed in Larson (2005) the exact temperature depen- 
dence is very uncertain, and can vary from oc r'^ (Jappsen 
et al. 2005, dotted line) to oc T^-^^ (Larson 1985, dashed 
line). The normalization of these models is also very uncer- 
tain. The value of ^ 0.3 Mq is the approximate value of the 
Jeans mass in present-day cores (Larson 2005), but other as- 
pects may play a role in setting the characteristic mass at low 
redshift (see McKee & Ostriker 2007). The grey solid line is 
a non-physical "toy model" which fits the Milky Way disk, 
submm-galaxies, and cluster galaxies and is intermediate be- 
tween the two physically-motivated models at high redshift. 
This model has the form 

m2 = 0.082(l + (rcMB/6)') (20) 

and is used to explore the effects of an evolving characteristic 
mass in the following Sections. 
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Fig. 11. — Estimates of the characteristic mass of the IMF, as a 
function of the epoch of star formation. The point marked "clus- 
ter galaxies" is the best-fit Solar metallicity model to the observed 
evolution of massive early-type galaxies in clusters. Also shown are 
estimates for the present-day Milky Way disk, for globular clusters 
in the inner halo, and for submm-galaxies (SMGs). Broken lines 
are examples of models of Larson (2005), in which the characteristic 
mass is a strong function of the temperature of the microwave back- 
ground radiation at suffuciently high redshift. The dashed (dotted) 
line shows a model with a oc T '^^ (oc T'^) dependence at z > 2. The 
grey solid line is a non-physical model which fits the available data. 

It is interesting that the IMF of old globular clusters appears 
to have a lower characteristic mass than the IMF of massive 
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cluster galaxies, even though their stars probably have similar 
mean ages. Determining the IMF in globular clusters is no- 
toriously difficult, as the mass function as observed today is 
heavily influenced by dynamical effects in combination with 
stellar evolution (see, e.g., Chabrier 2003). In particular, glob- 
ular clusters with a very top-heavy IMF would not be expected 
to survive in the tidal field of our Galaxy for a Hubble time 
(e.g., Joshi, Nave, & Rasio 2001), and the clusters that are 
surviving today might not be representative for the original 
population. Nevertheless, taking the existing constraints at 
face value, it seems that other parameters than the CMB tem- 
perature (e.g., metallicity, star formation rate, or mass of the 
star forming complex) may play a role in determining nic at 
high redshift. 




2 4 6 

z 

Fig. 12. — Effects of an evolving IMF on the cosmic star formation 
history. The dashed line in (a) shows the change in the total initial 
stellar mass with redshift, if the characteristic mass nic evolves ac- 
cording to Eq.|20] The solid line shows the total mass relative to the 
mass in stars of > IOM0, which supply most of the UV luminosity, 
(b) Measurements of the volume-averaged star formation rate, from 
Hopkins (2004) and Bouwens et al. (2007). Grey points are for a 
Salpeter (1955) IMF, and black points are for a Chabrier (2003) IMF 
with evolving nic. The modified evolution shows a peak at z ~ 1.5. 

7.3. Cosmic Star Formation History 

As pointed out by others (e.g., Larson 2005; Hopkins & 
Beacom 2006; Tumlinson 2007; Fardal et al. 2007), an evolv- 
ing IMF has direct implications for the derived cosmic star 
formation history. Essentially all indicators of the star forma- 
tion rate (such as ultraviolet luminosity. Ha line luminosity, 
and infrared luminosity) measure the effects of luminous O 
and B stars, which have masses > \QMq. Inferred total star 
formation rates therefore rely on an extrapolation of the IMF 
from ~ IOMq down to ~ 0.1 Mq (see, e.g., Lilly et al. 1996; 
Madau et al. 1996; Madau, Pozzetti, & Dickinson 1998). As 



most of the total mass is in low mass stars, changes to the 
IMF in the mass range 0. 1-1 Mq will have virtually no effect 
on the observed star formation indicators but potentially large 
effects on the actual total stellar mass that is produced. 

The effects of an evolving IMF of the form of Eq. |20] are 
shown in Fig. fT2l a). The broken line shows the integral of 
Eq.[T8] i.e., the integrated stellar mass for an increasing value 
of mc with redshift. The solid line shows the integrated mass 
relative to that in stars with masses > \QMq, as the energy 
output of stars in this mass range drives the star formation 
measurements. The effects are substantial at high redshift, 
and at z ^ 6 the star formation rate could be overestimated by 
a factor of 3^ relative to z = 0. 

In Fig. [T2jb) the correction of panel (a) is applied to mea- 
surements of the star formation rate at a range of redshifts. 
The data are a combination of the extensive compilation of 
Hopkins (2004) and recent measurements at high redshift by 
Bouwens et al. (2007). Extinction-corrected values were 
used from both studies. The data as reported in the litera- 
ture are shown in grey, and the grey curve shows a simple fit 
to the points. Black points are the same data, corrected to the 
modified Chabrier (2003) IMF proposed in this paper. This 
correction is a combination of two effects: a constant offset 
to account for the difference between a Salpeter (1955) IMF 
(which is assumed in the quoted literature) and a Chabrier 
(2003) IMF, and the redshift-dependent effect shown in Fig. 
fTSl' a). As is clear from the black line, the cosmic star forma- 
tion history has a fairly well-defined peak at z ^ 1 .5 if the IMF 
depends on redshift. 

7.4. M/L Ratios 

The influence of an evolving IMF on the M/L ratios of 
galaxies is complex, as several competing effects play a role. 
The number of low mass stars with respect to high mass stars 
is reduced, which lowers the M/L ratio as these stars con- 
tribute little to the integrated light. However, for m^- > 0.4 Mq 
the number of turn-off stars is also reduced (see Fig. [TOl l. 
and these stars dominate the light at rest-frame optical wave- 
lengths. As a result, the net effect of an increased m^ on the 
M/L ratio is generally smaller than on the star formation rate. 
A further complication is that the turn-off mass can be simi- 
lar to the characteristic mass (e.g., in elliptical galaxies at low 
redshift where both values are ~ 1 Mq). This means that the 
effect on the M/L ratio is not a constant, but depends on the 
age of the population. A final complication is the mass in 
stellar remnants, which is a larger fraction of the total stel- 
lar mass for more top-heavy IMFs. As discussed by Maras- 
ton (2005) and in § 16.61 IMFs that have a powerlaw slope of 
zero from 0.1- IOOMq imply completely remnant-dominated 
mass functions at old ages, with unrealistically high M/L ra- 
tios. 

A correct treatment of these issues requires full stellar pop- 
ulation synthesis modeling, which is beyond the scope of the 
present paper Instead, we used simple stellar evolutionary 
tracks to estimate what the net outcome is of the various com- 
peting mechanisms. The Yale-Yonsei isochrones were used 
(Yi, Kim, & Demarque 2003; Demarque et al. 2004). Monte- 
Carlo simulations of 100,000 stars were generated, with Solar 
metallicity, a range of ages, and a Salpeter (1955) IMF. Next, 
the mass function was resampled to match Eq. [18] for a grid 
of values of m^. The total V band luminosity was determined 
by linearly adding the light of individual stars. The mass in 
living stars was determined by integrating Eq. [18] up to the 
turn-off mass. The mass in remnants was calculated by in- 
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tegrating Eq. [18] from the turn-off mass to IOOM0 and mul- 
tiplying by the fraction of the initial mass that is retained in 
the form of black holes, neutron stars, and white dwarfs. This 
(age-dependent) fraction was determined in the same way as 
described in Bruzual & Chariot (2003). 

The results are shown in Fig. [T3] At young ages the light 
is dominated by relatively massive stars, and the behavior is 
similar to that of the implied star formation rate (see Fig.[T2h). 
At intermediate ages and high values of nic the turn-off mass 
is similar to the characteristic mass, and the effect on the lu- 
minosity is similar to the effect on the mass. The net effect is 
a roughly constant M/L^ ratio as a function of nic- At the old- 
est ages the IMF becomes remnant-dominated, with ~ 80 % 
of the mass in remnants for an age of 1 Gyr and nic =1.5 Mq . 
Therefore, the M/Ly ratios are actually higher than for a stan- 
dai-d Chabrier (2003) IMF wih nic = 0.08 M©, approaching or 
even exceeding those implied by a Salpeter (1955) IMF. Al- 
though these results are somewhat uncertain due to the crude 
nature of our modeling, the qualitative conclusion is that the 
M/Ly ratios of galaxies are not necessarily strongly reduced 
by an evolving IMF, in contrast to their star formation rates. 
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Fig . 13. — Effects of a changing characteristic mass on the M jLy ra- 
tio for stellar populations of ages 0.1, 0.3, 0.5, 3, 5, and 10 Gyr. The 
results for higher ages are somewhat uncertain and noisy, largely 
due to numerical effects. For young ages the MjL ratio steadily 
declines with increasing characteristic mass. The behavior is more 
complex when the characteristic mass is similar to the turn-off mass. 
For mc ~ 1 Mq and old ages the mass function becomes remnant- 
dominated, and the M/L ratios approach those implied by a Salpeter 
(1955) IMF 



7.5. Evolution of the Stellar Mass Density 

In Fig. [14] we show the effects of an evolving IMF on the 
cosmic stellar mass density, in a similar way as was done in 
Fig.[T2]for the cosmic star formation history. Panel (a) shows 
the redshift dependence of the M/Ly ratio, with respect to 
the M/Lv ratio implied by a standard Chabrier (2003) IMF. 
To generate this relation it was assumed that all galaxies start 
forming stars at z = 10, and have a constant star formation 
rate until the epoch of observation. The implied mean age of 



galaxies at z = 6 is ~ 0.2 Gyr, and the mean age of galaxies at 
z = is ^6.5 Gyr These values are reasonable, and we note 
that the results are not very sensitive to the details of the star 
formation history of the galaxies. This assumption provides 
an estimate of the mean mass-weighted formation time of the 
stars, and hence through Eq. |20]the appropriate value of nic- 
The relations between age, m^, and M/Ly ratio shown in Fig. 
[T3]are then used to estimate the change in M/Ly as a function 
of redshift. 




z 



Fig. 14. — Effects of an evolving IMF on the cosmic stellar mass 
density. The broken line in (a) shows the effect on the M/Lv ratio 
with respect to a standard Chabrier (2003) IMF. Discontinuities are 
due to the discrete grid of ages used in the computation; the solid 
line is a smoothed version of the dotted line, (b) Measurements of 
the mass density from Cole et al. (2001), Dickinson et al. (2003), 
Drory et al. (2005), Rudnick et al. (2006), and Fontana et al. (2006). 
Grey symbols are for a Salpeter (1955) IMF, and black symbols are 
for the evolving IMF proposed in this paper. The grey and black lines 
show the evolution implied by the evolution of the star formation rate 
shown in Fig.[T2] The black line is a somewhat better fit to the black 
points than the grey line is to the grey points. 

Going up in redshift, the M/Ly ratio first declines due to the 
increase in nic from ^ 0.1 to ^ 0.4 Mq. The M/Ly ratio 
reaches a minimum at z 2, and then increases as the char- 
acteristic mass becomes similar to the turn-off mass. Beyond 
z ^ 4 the M/L ratio once again decreases, as the galaxies be- 
come younger and the turn-off mass increases rapidly. Mea- 
surements of the evolution of the mass density from Cole et al. 
(2001), Dickinson et al. (2003), Drory et al. (2005), Rudnick 
et al. (2006), and Fontana et al. (2006) are shown in panel (b). 
Errorbars are taken from the literature sources, and typically 
do not include systematic uncertainties. Grey points show the 
literature values, determined assuming a Salpeter (1955) IMF, 
and black points show the corrected values. The correction 
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is a constant offset (to account for the difference between a 
Salpeter (1955) IMF and a standard Chabrier (2003) IMF) in 
addition to the relation shown in panel (a). 

The grey and black lines show the evolution of the stel- 
lar mass density as implied by the observed evolution of the 
star formation rate shown in Fig. [12] Mass loss was taken 
into account using the same scheme as employed by Bruzual 
& Chariot (2003). Note that mass loss is a larger effect for 
Chabrier-like IMFs than for a Salpeter (1955) IMF, as a larger 
fraction of the total mass is in high mass stars. 

The median difference between the data points and the 
curve is a factor of 2.3 for a non-evolving IMF, and a factor of 
1 .7 for the evolving Chabrier-like IMF. Considering that this 
type of comparison has many systematic uncertainties quite 
independent of the IMF (Hopkins & Beacom 2006; Fardal 
et al. 2007), the fact that the two independent measures of 
the build-up of stellar mass in the Universe agree to within a 
factor of 2 can be considered a success. In any case, the 
discrepancy is smaller for the evolving IMF than for a non- 
evolving IMF, because the star formation rate is reduced by a 
larger fraction than the mass density. Our evolving IMF has 
a qualitatively similar effect as the "paunchy" IMF with an 
increased contribution from stars around 1.5^M0 proposed 
by Fardal et al. (2007), which was specifically designed to 
give better agreement between the star formation history of 
the Universe (as implied by the extragalactic background ra- 
diation) and the observed evolution of the mass density. 

8. SUMMARY AND CONCLUSIONS 

This paper compares the color evolution of massive cluster 
galaxies to their luminosity evolution, with the aim of con- 
straining the form of the IMF at the time when the stars in 
these galaxies were formed. It is found that the evolution 
of the rest-frame U -V color is not consistent with the pre- 
viously determined evolution of the rest-frame M /Lb ratio, 
unless the IMF slope is significantly flatter than the Salpeter 
value around 1 M©. For standard IMFs with a slope of 1.3 at 
m > I Mq the luminosity evolution is too fast for the mea- 
sured color evolution, and the implied stellar ages derived 
from M/L evolution and color evolution are not consistent 
with each other. The only models that are able to fit the 
color evolution and the luminosity evolution simultaneously 
have IMF slopes of ^ around 1 Mq and mean luminosity- 
weighted stellar formation redshifts of ^ 4 (for Solar metal- 
licity). 

This result is somewhat uncertain, as the currently available 
sample of cluster galaxies with accurate rest-frame U-V col- 
ors and dynamical masses is somewhat limited and there are 
many systematic effects which may play a role. In particu- 
lar, it is an open question whether stellar population synthesis 
models are able to predict color evolution with the required 
accuracy. The commonly used Bruzual & Chariot (2003) and 
Maraston (2005) models give broadly similar answers, but 
that may be because they share many of the same assump- 
tions. 

As discussed in §|6]the higher stellar ages implied by a flat 
IMF are consistent with many other studies, which lends some 
credibility to the results presented here. Of particular impor- 
tance is the agreement with the data on Balmer line strengths 
of Kelson et al. (2001), as they do not suffer from the same 
systematic uncertainties as the color data. Formation redshifts 
substantially larger than two also fit more comfortably with 
the direct detection of old galaxies at high redshifts. A firm 
independent measurement of the star formation epoch of mas- 



sive cluster galaxies, combined with a better understanding of 
selection effects at high redshift, would leave the IMF as the 
only free parameter and greatly simplify the problem. 

The implications discussed in §|7] are obviously somewhat 
speculative. Although the interpretation in terms of an evolv- 
ing characteristic mass is physically plausible according to 
some models (e.g., Larson 2005), many other forms of the 
IMF are consistent with the data. The observations described 
in this paper are only sensitive to a narrow mass range near 
1 Mq, and the IMF proposed in Eq. [T8]represents a very sub- 
stantial extrapolation. This is illustrated in Fig. [15] both the 
top-heavy IMF (red line) and the "bottom-light" IMF (green 
line) are consistent with the data presented in this paper. The 
main reason for preferring the bottom-light IMF over a top- 
heavy form is that the absolute M/L ratios of galaxies are 
within a reasonable range. As shown in Fig.[T3]the M/Ly ra- 
tios are similar to those implied by a standard Chabrier (2003) 
IMF, which means that they are consistent with dynamical 
measurements at z = (Cappellari et al. 2006). 
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Fig. 15. — Illustration of the key results of this paper. In §4 the 
slope of the IMF of massive cluster galaxies x is estimated to be 
approximately -0.3 in a narrow region around 1 Mq (thick black 
line). The red dashed line and the green solid line show two possible 
interpretations: a global change of the slope of the IMF (with respect 
to a standard Chabrier or Salpeter IMF) at all masses (red dashed 
line), or a change in the characteristic mass (solid green line). The 
"top-heavy" interpretation is inconsistent with the dynamical M/L 
ratios of nearby elliptical galaxies (see § 16. 6t , whereas the "bottom- 
light" interpretation is consistent with all data that we are aware of. 
The blue and yellow areas illustrate that stars with masses > IOMq 
drive star formation measurements, whereas stars with masses 1 - 
5Mq drive M/L measurements (see § 7.3-7.5). 

An "unintended" effect of an evolving IMF of the form pro- 
posed here is that it reduces the discrepancy between the ob- 
served stellar mass density and the density implied by the cos- 
mic star formation history. This result is in excellent (albeit 
qualitative) agreement with several other recent studies (e.g., 
Hopkins & Beacom 2006, Fardal et al. 2007, Perez-Gonzalez 
et al. 2007; Wilkins, Trentham, & Hopkins 2007; Dave 2007; 
see also, e.g.. Fields 1999). The differences between a non- 
evolving IMF and an evolving IMF are fairly large at z ^ 4 (as 
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the effect on the star formation rate is strong and the effect on 
M/L ratios is weak at that redshift), and it will be interesting 
to see where future measurements of the mass density at high 
redshift will fall in Fig. [14] We note that the effects on the 
MjLy ratios are somewhat uncertain, as they rely on rather 
rudimentary stellar population synthesis modeling. The ef- 
fects on star formation rates are more robust, and suggest that 
the cosmic star formation rate peaked at z ^ 1.5. 

This paper adds to previous theoretically and observation- 
ally motivated suggestions that the IMF may evolve with red- 
shift (e.g., Worthey et al. 1992; Larson 1998, 2005; Fields 
1999; Blain et al. 1999a; Baugh et al. 2005; Stanway et al. 
2005; Hopkins & Beacom 2006; Tumlinson 2007; Lacey et 
al. 2007; Fardal et al. 2007; Perez-Gonzalez et al. 2007). Al- 
though these studies vary greatly in their parameterization of 
IMF evolution and the range of stellar masses that are con- 
sidered, they all suggest that the ratio of high-mass stars to 
low-mass stars was higher in the past. It should be pointed 
out that most of these papers invoke a change in the IMF as 
a "last resort" possibility, to explain data that are otherwise 
difficult to interpret. In the present study a different approach 
was followed, in that we set out with the specific purpose of 
constraining the slope of the IMF. An advantage of the ap- 
plied method is that it is fairly direct, as the rate of luminosity 
evolution is determined by the number of stars as a function 
of mass. Disadvantages are that it is only sensitive to a very 
limited mass range (see Fig.fTSll; that it relies on stellar pop- 
ulation synthesis models, which are not well calibrated in the 
relevant parameter range; that the progenitors of early-type 
galaxies may not be representative for the general population 
of high redshift galaxies; and that the currently available data 
are somewhat limited. 

Accepting the possibility of an evolving characteristic 
mass, it is interesting to speculate what could be the cause, 
or causes. The proximate cause may well be a higher tem- 
perature in molecular clouds at high redshift, which would 
raise the Jeans mass and could inhibit the formation of low 
mass stars (Larson 1998, 2005). The ultimate cause could be 
the higher temperature of the cosmic microwave background. 
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the fact that star formation tends to proceed in more extreme 
environments at higher redshift, or a combination. Available 
information on IR-bright galaxies suggests that dust tempera- 
tures in star burst galaxies are of order 30-40 K (Dunne et al. 
2000; Chapman et al. 2005), and hence exceed the CMB tem- 
perature for all relevant redshifts. However, it is as yet un- 
clear what fraction of the total star formation has taken place 
in these extreme environments (see, e.g., Reddy et al. 2007). 

The analysis presented here can be improved in various 
ways. The number of clusters with accurate rest-frame V -Y 
is currently smaller than the number of clusters with accurate 
M /Lb measurements, and this can be remedied by obtaining 
accurate (space-based) photometry in well-chosen filters of 
the remaining clusters in the vv07 sample. It is also important 
to measure the evolution in a redder rest-frame color, such as 
V -I. Redder color suffer less from possible contributions of 
hot stars, and their evolution is probably somewhat better cal- 
ibrated in stellar population synthesis models. This requires 
very accurate photometry in the near-infrared, which should 
be possible with WFC3 on HST. On the modeling side, it 
would be helpful to implement more variations of the IMF in 
stellar population synthesis codes than the standard Salpeter, 
Kroupa, and Chabrier forms. Ultimately it may be fruitful 
(or prove necessary) to have the characteristic mass, or some 
other parameter describing the form of the IMF, as one of the 
"standard" parameters in these models, on a par with the age 
and metallicity. 
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APPENDIX 
APERTURE CORRECTIONS 

For five of the seven distant clusters discussed in this Paper colors were measured directly from HST images in apertures of 
fixed physical size. For each cluster this size was chosen to match an 11" diameter aperture at the distance of the Coma cluster, 
allowing a direct comparison to data from BLE92 and to Mcintosh et al. (2005). For the clusters MS 1054-03 (z = 0.831) and 
RX J0152-13 (z = 0.837) color measurements were taken from Blakeslee et al. (2006), who used apertures scaled to contain 50 % 
of each galaxy's light. In this Section we give the corrections that were applied to bring the Blakeslee et al. (2006) data onto our 
system, and test these corrections using data from Mcintosh et al. (2005). 

The reason why aperture corrections need to be applied even when data are corrected for seeing effects is that early-type 
galaxies have color gradients. These gradients do not vary much from galaxy to galaxy, when expressed as A{Fi -F2)/ A{logr), 
with Fi and F2 the relevant filters and logr the logarithm of the circularized radial distance to the center of the galaxy (see, e.g., 
Franx, Illingworth, & Heckman 1989; Franx & Illingworth 1990; Peletier et al. 1990). The measured color within an aperture r^p 
is not simply the integral of the color gradient, as the radial dependence of the luminosity has to be taken into account as well. 

We determine the expected dependence of color on aperture size numerically, assuming an r'/"* law for the radial luminosity 
dependence. The solid line in Fig. lAll shows the expected relation for a color gradient of -0.15, appropriate for U -V (e.g., 
Peletier et al. 1990). The long-dashed line is a polynomial fit to the numerically-derived relation, of the form 

(Fi -F2),p = (Fi -F2)eft+6.67^^--^ (-0.1181og (2r,/D,p) -0.042 (log (2r,/D,p))') , (Al) 

with {Fi -F2)ap the color measured through an aperture of diameter Dap, {Fi -F2)eff the color measured through an aperture 
containing 50 % of the galaxy's luminosity, the half-light radius, and A(Fi -F2)/ Alogr = -0.15 the color gradient in mag per 
dex. The long-dashed line is a very good match to the (exact) solid line. For reference, short-dashed lines show the behavior of 
Eq. lAll for color gradients of -0.10 and -0.20. The data points show binned data from Mcintosh et al. (2005), who measured 
U -V colors of three nearby clusters in apertures of fixed size and in apertures of diameter Ir^. The data follow the predicted 
relation very closely, with an rms of ^ 0.01 and no apparent systematic deviations. 




2 r /D 

e ' op 

Fig . A 1 . — Effect of color gradients on the measured color within an aperture. The solid line shows the expected difference between the color 
measured within aperture Dap and the color measured within an aperture containing 50 % of the galaxy's light, assuming an r^^* law and a color 
gradient of -0.15 mag per dex. Datapoints are from Mcintosh et al. (2005); they follow the predicted curve very well. Broken lines are simple 
polynomial fits to the predicted evolution for color gradients of -0. 10, -0. 15, and -0.20. 

The datapoints from Blakeslee et al. (2006) were corrected to our fiducial aperture size by calculating r^/D^p for each galaxy 
(using the information supplied in Blakeslee et al.) and applying Eq. lAll As on average Dap ~ Ir^ for our aperture size the 
correction is only ^ 0.015 in the mean for both clusters, with an object-to-object scatter of 0.03. Galaxies requiring a correction 
> 0.05 or < -0.05 were removed from the sample; this step slightly reduces the scatter in the color-mass relations of the clusters 
but otherwise has a neghgible effect on our results. 



